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Abstr act 


The  goal  of  this  thesis  is  to  find  "good"  scheduling  rules 
for  computer  systems  that  have  more  than  one  scarce  resource,  and 
to  investigate  program  interactions  in  such  systems.  The 
approach  involves  the  analysis  of  yet  another  queueing  network 
model . 


The  basic  model  includes  two  processors,  which  m 
of  as  a  CPU  and  an  I/O  channel,  and  two  program  clas 
distinct  burst  time  characteristics.  Three  types  of 
functions  are  considered:  total  processor  utilizati 
run  time  expansion  factor,  and  various  rewards  for  p 
completion.  A  scheduling  discipline  optimal  with  re 
first  of  these  measures  is  defined. 

No  claim  is  made  that  the  model  is  entirely  real 
the  results  are  immediately  applicable  to  actual  com 
systems.  Rather,  it  is  hoped  that  a  solid  understan 
interaction  effects  that  arise  in  this  relatively  si 
will  furnish  a  basis  for  the  formal  and  intuitive  co 
of  more  complex  environments. 
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The  thesis  includes  a  survey  of  various  heuristic  approaches 
to  the  schedulinc  of  multiple-resource  computer  systems.  The 
more  sophisticated  queueing  network  models  in  the  literature  are 
also  discussed  in  some  detail.  An  annotated  bibliography  of  the 
subject  area  is  provided. 


-11- 


T 

would 

like 

to  thank 

for  h 

is  pat 

ience. 

support 

my 

sa 

cond  reader. 

Pro  f  ess 

su 

gge 

stions 

and  h 

is  conti 

Un 

ive 

rsity  ' 

of  Tor 

onto  pro 

in 

Ap 

pendix 

A.  J 

ohn  Gutt 

FI 

ain 

e  Hich 

of  Ca 

rnegie-M 

th 

sir 

careful  rea 

dings  an 

my  supervisor 
,  guidance  and 
or  J.J.  Hornin 
nuing  interest 
vided  generous 
ag  o  f  the  Univ 
ellon  TJniversi 
d  helpful  comm 


f  Professor  K.C.  Sevcik, 
general  good  humor,  and 
g,  for  his  many  valuable 
.  Michael  Bauer  of  the 
assistance  with  the  proof 
ersity  of  Toronto  and 
ty  deserve  mention  for 
ents. 


For  friendship  and  encouragement  during  a  period  when  I 
needed  both  very  badly,  I  owe  many  persons  in  and  around  the 
Department  of  Computer  Science  a  debt  that  I  can  never  adequately 
repay . 


Financial  support  was  gratefully  received  from  the  University 
of  Toronto  and  from  the  Department  of  Computer  Science. 


"Forsan  et  haec  olim  raeminisse  iuvabit 

[Virgil  2 SBC] 


II 


-111- 


Contents 


?^bstract 


i 


A.C  knowledge  me  nts 


ii 


Contents  iii 


1,  Introduction  1 

1.1  Preliminary  definitions 

1.2  Objective  functions 

1.3  Service  time  information 

1.4  Optimal  scheduling  rules 


2.  Heuristic  approaches  12 

2.1  The  balanced  set  principle 

2.2  Feedback  scheduling 

2.3  Multi-level  scheduling 

2.4  Mahl’s  work 

2.5  Experiments  in  m ultiple- resource  scheduling 


3.  Queueing  network  models  20 

3.1  Mitrani’s  cyclic  queue  model 

3.2  Euzen’s  central  server  model 

3.3  Baskett’s  sequence  of  network  models 

3.4  The  general  model  of  Muntz  and  Baskett 

3.5  Anderson’s  model  with  N  distinguishable  tasks 

3.6  Omahen’s  multiple-resource  queues 


4.  Systems  with  two  processors  and  two  programs  36 

4  .  1  The  model 

4.2  State-transition  representations 

4.3  Balance  equation  solution 


5.  Total  processor  utilization  as  an  objective  function  45 

5.1  FCFS  versus  the  priority  disciplines 

5.2  Service  rate  biasing  versus  processor  sharing 

5.3  Full  priority  schedules 

5.4  ’’Good"  full  priority  scheduling  rules 


IV- 


6.  Program-oriented  objective  functions  77 

6.1  Processor  sharing  and  blowup  factors 

6.2  Schedules  that  impact  a  particular  program  class 

6.3  General  observations 

6.4  When  programs  cycle  finitely  in  the  system 

7.  Conclusions  and  extensions  92 

A.nnotated  bibliography  95 

A,ppendix  P.:  A  proof  that  some  pair  of  full  priority  109 

schedules  is  sufficient  to  achieve  maximum  total 
processor  utilization 

Appendix  B:  A  proof  of  the  optimality  of  the  SBT-PSBT  113 
discipline 


1 


^  ntrodnct ion 


_1_ 


A  computer  system  consists  of  a  number  of  resources  (or 
facilities)  which  must  be  allocated  to  the  programs  (or  users,  or 
processes)  that  request  them.  This  general  resource  allocation 
problem  may  be  referred  to  as  scheduling.  While  viewing 
scheduling  as  the  center  of  the  universe  unfairly  relegates  to 
subproblem  status  a  number  of  independently  interesting  topics 
(e.g. ,  the  deadlock  problem),  it  provides  a  convenient  frame  of 
reference . 

Resources  may  be  considered  to  fall  into  two  classes:  active 
(e.g.,  processors  and  channels)  and  £assiye  (e.g.,  memory). 
Generally,  a  program  possesses  at  least  one  active  resource  at 
any  time  it  is  progressing,  but  it  may  simultaneously  require 
many  different  passive  resources.  The  problems  considered  in 
this  thesis  involve  the  allocation  of  active  resources,  which 
will  be  treated  uniformly. 


This  thesis  is  concerned  with  scheduling  so  as  to  make 
efficient  use  of  systems  that  contain  more  than  one  scarce 
resource.  Although  this  description  includes  most  contemporary 
computer  systems,  lit-ble  work  has  been  reported  concerning  the 
formal  analysis  of  such  systems.  Most  studies  fall  into  one  of 
three  categories.  First,  certain  scheduling  rules  have  been 
proven  optimal  for  very  restricted  systems  and  environments. 
Second,  a  number  of  heuristic  guidelines  for  the  efficient 
scheduling  of  multiple-resource  systems  have  been  devised,  and 
some  of  these  have  been,  incorporated  into  actual  operating 
systems  with  a  degree  of  success.  Third,  the  performance  of 
various  models,  often  incorporating  severe  restrictions  and  a 
limited  number  of  scheduling  options,  has  been  analyzed  with 
respect  to  a  set  of  generally  inadequate  objective  functions. 


The  present  vrork  attempts  to  present  an  orderly  study  of 
systems  with  two  processors,  which  may  be  thought  of  as  a  central 
processor  and  an  I/O  channel,  and  two  active  programs  with 
different  characteristics.  The  performance  of  many  "reasonable” 
scheduling  rules  is  evaluated  with  respect  to  a  number  of 
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objective  functioris,  and  the  interactions  of  programs  with 
various  characteristics  are  explored.  A  thorough  understanding 
of  the  behavior  of  such  simple  systems  is  a  prerequisite  to  the 
comprehension  of  more  complex  ones. 

This  thesis  is  largely  self-contained.  It  does,  however, 
presume  some  familiarity  with  the  terminology  of  operating 
systems  and  probability  theory.  The  material  is  organized  in  the 
following  manner: 

The  remainder  of  Chapter  One  discusses  general  scheduling- 
related  material  (objective  functions  and  service  time 
information)  and  optimality  proofs  for  scheduling  rules  in 
systems  with  one  scarce  resource. 

Chapter  Two  is  devoted  to  a  discussion  of  the  heuristic 
principles  underlying  the  efficient  scheduling  of  multiple- 
resource  computer  systems  and  the  experimental  attempts  to  apply 
these  principles. 

Chapter  Three,  after  discussing  the  requirements  for  a  useful 
analytic  model  of  multiple-resource  systems  and  evaluating  the 
simplifying  assumptions  usually  encountered,  surveys  a  number  of 
the  more  promising  models  that  have  appeared  in  the  literature. 

Chapter  Two  and  Sections  3.1  through  3.6  may  be  omitted  with 
some  loss  in  baclcground  and  motivation.  It  is  suggested  that  the 
reader  with  limited  time  or  interest  do  so. 

In  Chapter  Four,  the  queueing  network  model  used  in  this 
thesis  is  presented  and  the  method  of  solution  is  described.  The 
behavior  of  the  model  with  respect  to  processor-oriented 
objective  functions  is  explored  in  Chapter  Five.  The  performance 
of  many  schedules  and  scheduling  rules  is  evaluated,  and  certain 
novel  disciplines  are  presented. 

Chapter  Six  is  devoted  to  program-oriented  objective 
functions,  and  Chapter  Seven  summarizes  the  results  of  the 
thesis. 


Preliminary  definitions 

Before  beginning  the  discussion,  a  few  elementary  definitions 
are  necessary: 

Definition:  A  gueue  is  a  set  of  programs  awaiting  service  at  a 

particular  processor. 

Definition:  A  schedule  indicates  which  member  of  a  gueue  should 

be  receiving  service  at  a  particular  instant  in  time. 

Definition;  A  scheduling  rule  or  discipline  is  an  algorithm 
according  to  which  a  schedule  is  selected. 

Scheduling  rules  may  be  either  preemptive  or  non-preempti ve.  In 
the  former  case,  the  preemption  criteria  are  also  specified  by 
the  scheduling  rule. 

Definition:  A  Ereemptiye  scheduling  rule  may  interrupt  a  program 

that  is  receiving  service  at  a  processor.  Execution  of  the 
program  may  later  be  resumed  at  the  point  of  interruption. 

Scheduling  decisions  as  considered  in  this  thesis  need  only 
be  made  when  a  conflict  exists;  i.e.,  when  more  than  one  program 
is  requesting  service  at  some  processor.  In  a  multiple-resource 
system,  queues  may  form  at  several  processors,  and  a  scheduling 
rule  must  be  specified  for  each.  In  other  words,  a  single 
scheduling  rule  is  not  specified  for  the  system  as  a  whole,  but 
rather  a  rule  is  specified  for  each  processor  where  contention 
may  occur. 

The  goal  of  a  scheduling  rule,  or  of  a  set  of  scheduling 
rules  in  a  multiple-resource  system,  is  to  optimize  some 
particular  objective  function. 

Definition:  A.n  objective  function  formally  specifies  the 

criteria  according  to  which  the  performance  of  various 
schedules  or  scheduling  rules  is  evaluated. 

In  attempting  this  optimization,  the  rule  may  make  use  of  various 
information  concerning  the  behavior  of  programs  and  the  nature  of 
the  objective  function.  The  two  sections  that  follow  discuss 
these  matters  in  detail.  Scheduling  rules  may  be  simple  or 
complex,  and  the  information  upon  which  they  base  decisions  may 
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be  readily  accessible  or  difficult  to  obtain.  In  examining  the 
performance  of  rules  from  a  theoretical  point  of  view,  it  is 
important  to  bear  in  mind  the  expense  that  a  particular  rule  will 
incur  in  collecting  data,  computing  a  schedule,  and  preempting 
active  programs. 


1 . 2  Objective  functions 

To  specify  the  criteria  according  to  which  scheduling  rule 
performance  will  be  evaluated  is  an  extremely  difficult  task, 
particularly  because  of  the  contradictory  nature  of  many 
seemingly  reasonable  candidates. 

Objective  functions  fall  into  two  primary  categories: 
processor-  (or  system-)  oriented,  -and  program-oriented. 

Functions  of  the  latter  type  may  be  further  subdivided  into  those 
that  consider  individual  programs,  and  the  more  common  measures 
of  mean  or  average  behavior. 


Probably  the  simplest  and  most  frequently  encountered 
computer  system  objective  function  is  central  processor 
utilization.  There  are  several  problems  with  this  measure. 

First,  in  multiple-resource  systems  it  ignores  the  utilization  of 
all  but  the  central  processor.  This  matter  will  be  dealt  with  in 
Chapter  Five.  Second,  it  fails  to  distinguish  useful  processing 
from  overhead.  [Sherman  1972a]  cites  the  following  statistics 
from  a  benchmark  jobst-ream  for  two  operating  systems  on  a  CUT 
SOO: 

UT- 1  SCOPE  3.1.6 

Elapsed  time  218  392 

CPU  time  208  385 

CPU  util.  US'?’  9  87; 

In  spite  of  problems  like  this,  processor-oriented  objective 
functions  can  be  both  interesting  and  meaningful. 
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system.  These  functions  fail  to  account  for  two  important 
factors:  different  programs  are  of  x^arying  value  (or  importance) 

and  of  varying  length. 

The  first  of  these  deficiencies  may  be  overcome  by  defining 
an  objective  function  that  considers  the  loss  functions  of 
programs . 

Definition:  The  loss  function  of  a  program  indicates  how  the 

value  of  the  information  provided  by  the  program,  and  thus 
the  value  of  having  the  computation  completed,  changes  with 
time. 

loss  functions  come  in  all  shapes  and  sizes,  and  the  precise  form 
the  function  is  of  importance  in  proving  the  optimality  of 
various  schedulina  rules.  In  general,  loss  functions  have  an 
inflection  point  which  represents  the  ’'due  date"  of  the 
information.  This  leads  to  program-oriented  objective  functions 
that  deal  with  mean  and  maximum,  lateness  and  earliness,  as  well 
as  total  loss. 

The  most  primitive  means  of  distinguishing  the  loss  functions 
of  various  programs  is  the  external  priority  parameter  frequently 
found  in  batch  shops.  Since  the  days  when  an  entire  computer  was 
dedicated  to  a  single  individual  for  time  periods  whose  frequency 
and  duration  depended  upon  political  or  economic  clout,  external 
priorities  have  been  a  factor  in  computer  system  scheduling.  It 
is  i m portant  to  distinguish  this  simplistic  loss  function 
indication  from  the  internal  priority,  which  indicates  how  queues 
are  ordered.  To  state  an  external  priority  is  to  implv  that  it 
is  a  component  of  the  objective  function.  If  it  is  the  sole 
component,  then  possibly  internal  priorities  (in  other  words,  the 
scheduling  disciplines  themselves)  should  mirror  external 
priorities.  Numerous  authors  (e.g.,  Mitrani,  Section  3.1)  mahe 
the  mistake  of  scheduling  according  to  external  priorities,  yet 
evaluating  performance  with  respect  to  a  processor-oriented 
objective  function. 

The  second  factor  mentioned  above  is  that  programs  are  of 
varying  lengths.  The  subject  of  service  time  information  will  be 
considered  in  Section  1.3.  with  the  advent  of  interactive  time- 


-6- 


sharing  systems,  objective  functions  that  consider  program  length 
have  become  particularly  popular.  Intuitively,  it  is  felt  that 
short  programs  should  not  be  delayed  while  long  programs  finish; 
a  five  minute  delay  is  more  significant  to  a  one  minute  program 
than  to  a  fifteen  minute  program. 

In  Chapter  Six  of  this  thesis,  ’’blowup  factor”  will  be 
defined  as  an  objective  function.  Briefly,  the  blowup  factor  of 
a  program  is  the  ratio  betwe<='n  the  amount  of  time  the  program 
takes  to  process  under  a  specific  system  load  and  the  amaunt  of 
tims  that  it  wou] d  require  if  it  were  the  only  program  in  ths 
system.  A  similar  measure  is  proposed  in  [Hellerraan  1969]. 

?!uch  more  could  be  said  about  objective  functions.  Their 
’’reasonableness”  (and,  of  course,  their  complexity)  can  be 
increased  by  combining  processor-oriented  and  program-oriented 
measures  into  a  single  function.  Such  considerations,  howeve r, 
are  beyond  the  scope  of  this  thesis. 

1 . 3  Service  t ime  information 

It  is  well  known  that  scheduling  rules  that  make  use  of 
information  regarding  the  service  times  of  the  competing  programs 
can  yield  significantly  better  performance,  according  to  almost 
any  objective  function,  than  rules  that  do  not.  Scheduling  rules 
may  be  partitioned  according  to  the  degree  to  which  they  utilize 
such  information,  and  the  manner  in  which  they  obtain  it. 

Definition:  The  service  time  of  a  program  indicates,  at  any 

desired  level  of  detail,  the  amount  of  time  that  the  program 

requires  to  fulfill  its  needs  and  complete  service. 

The  most  familiar  service  time  measure  is  total  central  processor 
time.  Service  time  might  also  represent  a  program’s  total 
requirement  at  all  processors  or  its  burst  time  at  a  single 
processor  (the  time  for  which  a  program  uses  a  particular 
processor  before  requesting  service  at  some  other  processor) .  A 
thorough  study  of  service  time  information  at  the  level  of 
complete  programs  is  contained  in  [Lester  1973]. 
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Certain  scheduling  rules  make  no  use  of  service  time 
information.  ?i.mcng  these  are  rules  that  schedule  accorSing  to 
the  order  of  program  arrival  (e.g.,  last-come-f irst-served  and 
f irst-come-first- served)  and  rules  that  rotate  a  processar  among 
the  competing  programs  and  utilize  a  uniform  quantum  length 
(e.g.,  round-robin  and  processor  sharing). 

Other  rules  require  exact  information  about  service  times,  at 
the  level  of  either  total  processing  times  or  processor  burst 
times.  These  rules  include  shortest  processing  time  first, 
shortest  remaining  processing  time  first,  and  service  rate 
biasing. 

Finally,  there  are  rules  that  require  probabilistic  service 
time  information;  shortest  expected  processing  time  first, 
shortest  expected  remaining  processing  time  first,  and  smallest 
rank  are  examples. 

Formal,  definitions  of  most  of  these  scheduling  rules  will  be 
given  in  later  chapters.  Three  of  them  may  be  unfamiliar.  The 

discipline  rotates  a  processor  among  the 
competing  programs  with  an  infinitesimally  short  quantum  length. 
Service  rate  biasijnq  gives  a  program  priority  at  a  processor  in 
inverse  proportion  to  the  amount  the  program  uses  that  processor; 
the  greater  the  use,  the  lower  the  priority.  The  rank  of  a 
program  that  has  been  scheduled  for  a  certain  quantum  is  defined 
as  the  ratio  of  expected  processing  time  investment  to  t-he 
probability  of  program  completion;  the  smallest  rank  discipline 
([  Sevcik  1972a])  schedules  the  program  of  smallest  rank  for  the 
indicated  quantum. 

I 

While  scheduling  rules  that  make  use  of  service  time 
information  can  provide  significantly  improved  performance,  it  is 
clear  that  scheduling  on  the  basis  of  erroneous  service  time 
info.rmation  can  produce  results  at  least  as  bad  as  those  obtained 
using  a  no-information  rule.  To  what  extent,  by  what  means,  and 
on  what  time  scale  is  it  possible  to  obtain  accurate  service  time 
information  for  the  central  processor  and  other  processors? 

For  a  number  of  reasons,  that  is  a  difficult  question  to 

It  has  yet  to  be  demonstrated,  for  example,  that  users 
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are  capable  of  explicitly  providing  accurate  information  about 
the  total  central  processor  requirements  of  their  programs.  The 
"time  limit"  parameter  found  on  most  batch  job  cards  represents 
an  upper  bound  rather  than  an  estimate  and  these,  of  course,  are 
clearly  separate  functions.  Additionally,  the  usual  one-minute 
default  option  provides  an  umbrella  under  which  a  large  number  of 
programs  can  hide.  Fven  when  job  card  space  is  reserved  for 
coding  an  estimate,  adequate  incentives  for  accuracy  are  seldom 
provided.  This  is  not  unreasonable,  since  most  contemporary 
schedulers  would  not  know  what  to  do  with  accurate  service  time 
information  if  it  were  available.  [Metzger  1972]  proposes  the 
following  incentive  scheme:  charge  =  cost  of  resources  used  + 
squares  of  errors  in  service  time  estimates. 

Recently  [Lester  1973]  studied  the  University  of  Toronto 
Computer  Centre  batch  load  to  find  indirect  conditioning 
parameters  that  would  decrease  the  coefficient  of  variation  in 
the  central  processor  service  time  distributions  of  programs. 

She  met  with  some  success,  particularly  with  two- parameter 
conditioning  that  included  memory  requirement  as  a  predictor  of 
service  time. 

Obtaining  accurate  total  service  time  information  in 
interactive  systems  may  present  even  more  of  a  problem.  While  it 
is  possible  to  empirically  profile  the  service  time  requirements 
of  the  users  at  a  particular  installation  (cf.  [Scherr  1967]),  it 
is  obviously  unreasonable  to  require  explicit  service  tine 
information  along  with  every  command.  Conditioning  upon  command 
type  might  provide  a  useful  handle  on  this  problem. 

In  the  preceding  discussion  a  distinction  has  been  implied 
between  explicit  and  implicit  service  time  information.  While 
the  former  is  provided  by  the  user,  the  latter  is  either  derived 
from,  other  explicit  iaformation  or  extrapolated  from  observed 
program  behavior.  The  derivation  of  implicit  service  time 
information  requires  making  various  assumptions  about  this 
behavior. 

An  example  of  a  simple  and  successful  scheduling  rule  that 
uses  implicit  service  time  information  is  the  multiple-level 
feedback  scheme  employed  in  many  time-sharing  systems  (e.g.. 
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and  Hultics;  TScherr  1967^),  This  rule  assumes  that  central 
processor  demands  are  distributed  with  a  low  mean,  a  long  tail, 
and  a  median  much  less  than  the  mean.  Incoming  requests  are 
immediately  given  a  short  quantum  of  service.  If  a  program  has 
failed  to  complete  its  service  requirement  at  the  end  of  a 
quantum,  it  is  relegated  to  the  next  lower  level  of  the 
scheduling  hierarchy.  Programs  at  each  level  receive  a  larger 
quantum  of  service  than  those  at  higher  levels,  but  are  only 
served  if  no  higher-level  programs  are  present. 

A±  a  much  lower  level,  one  would  like  to  obtain  accurate 
estimates  of  burst  time  intervals  for  each  program  at  each 
processor.  This  is  certainly  not  the  sort  of  information  that 
users  could  even  indirectly  provide.  In  many  cases,  however, 
detailed  records  of  program  burst  time  activity  are  maintained 
for  accounting  purposes.  To  what  extent  do  programs  exhibit 
burst  time  locality  that  would  permit  historical  information  to 
be  used  to  predict  future  behavior? 

[Sherman  1972a3  (also  reported  in  [Sherman  1972b])  discusses 
experiments  performed  on  a  trace-driven  simulator  of  the  UT-1 
system  for  the  CPC  6600  that  compare  the  performance  of  shortest 
burst  time  first  central  processor  scheduling  using  exact  burst 
time  knowledge  to  the  performance  of  the  same  rule  when  various 
methods  of  CPU  burst  time  estimation  are  employed.  The  objective 
function  was  number  of  programs  completed  in  a  certain  time 
interval . 

Sherman’s  study  verified  that  individual  programs  exhibit 
considerable  central  processor  burst  time  locality;  in  other 
words,  that  there  is  a  strong  serial  correlation  of  burst  times. 
In  particular,  he  found  that  a  schedule  based  upon  exponentially 
biased  averages  of  previous  burst  times  came  reasonably  close  to 
approximating  the  performance  of  a  schedule  that  used  exact  burst 
time  information.  The  schedules  evaluated,  in  decreasing  order 
of  performance,  were: 

1.  A-Ctual  shortest  burst  time  first  (theoretically  optimal); 

2.  Exponential  smoothing  of  previously  observed  burst  times  as 

an  approximation  to  (1) : 

3.  Round-robin; 
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4.  Complete  program  burst  time  history  as  a  predictor  of  burst 
time ; 

5.  Fandom  guess  at  burst  time; 

6.  Theoretically  optimal  non-preemptive  schedule; 

7.  First-come-first- served . 

This  study  provides  convincing  evidence  that  it  may  be  possible 
to  dynamically  and  inexpensively  derive  accurate  and  useful  burst 
time  information. 

1 .  4  Optimal  scheduling  rules 

In  the  extremely  unlikely  event  that  a  computer  system 
manager  were  able  to  settle  upon  a  well-defined  objective 
function  for  his  installation,  it  would  still  be  unwise  to 
attempt  to  schedule  the  system  in  the  precisely  optimal  manner. 

In  all  probability,  the  parameters  upon  which  the  objective 
function  and  scheduling  rule  were  based  would  not  be  able  to  be 
monitored  with  total  accuracy,  and  even  if  this  were  possible, 
the  values  of  these  parameters  would  probably  change  so  rapidly 
that  maintaining  the  optimal  schedule  would  be  a  full-time  job. 
Because  of  these  considerations,  factors  such  as  the  robustness 
of  scheduling  rules  (the  degree  to  which  their  perf orman ce  is 
insensitive  to  errors  or  variations  in  scheduling  data) ,  their 
efficiency  or  simplicity,  and  their  ’’nearness”  to  theoretical 
optimality  are  of  primary  concern  in  actual  computer  systems. 

Nonetheless,  a  fair  amount  of  effort  has  been  devoted  to 
demonstrating  the  optimality  of  various  scheduling  rules  in 
certain  restricted  environments.  This  work  is  important  for  at 
least  two  reasons:  it  may  provide  a  clue  to  finding  good  rules 
for  more  general  environments,  and  it  furnishes  a  bound  against 
which  the  performance  of  actual  systems  may  be  evaluated. 

With  very  few  exceptions,  formal  optimality  results  deal  with 
single-resource  systems.  [  Schrage  1974]  provides  an  excellent 
overview  of  this  work.  The  more  important  results  are  the 
following : 

A.mong  rules  that  require  exact  service  time  information, 
scheduling  shortest  processing  time  first  (SPT)  is  optimal  with 
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respsct  to  minimizing  total  loss  if  preemption  is  disallDwed, 
program  arrivals  are  Poisson,  service  times  are  distributed 
generally,  and  all  programs  have  the  same  linear  loss  functiDn. 
When  preemption  is  free,  fSchrage  1968]  has  shown  that  scheduling 
shortest  remaining  processing  time  first  (SPPT)  is  optimal  under 
general  arrivals  with  respect  to  the  same  objective  function. 

The  proofs  of  the  above  statements  utilize  "interchange 
arguments":  it  is  shown,  for  example,  that  the  performance  of  a 

non-SPT  schedule  is  improved  by  interchanging  the  order  of 
programs  so  that  the  schedule  becomes  SPT.  This  type  of  argument 
does  not  generalize  to  systems  with  more  than  one  scarce 
resource. 

Among  rules  that  require  probabilistic  service  time 
information,  f Sevcik  1972a]  has  defined  the  smallest  rank  (SR) 
rule,  briefly  described  in  Section  1.3,  and  has  proven  it  optimal 
within  a  class  of  disciplines  at  minimizing  the  expected  rate  of 
total  loss,,un<3er  non- identical  linear  losses,  Poisson  program 
arrivals,  and  free  preemption.  SR  specializes  in  restricted 
environments  to  simpler  rules  that  have  been  proven  optimal;  for 
example,  to  shortest  expected  processing  time  first  (SEPT)  with 
Poisson  arrivals,  identical  linear  losses  and  no  preemption. 

Recently  [Price  1972]  has  considered  optimality  in  a  system 
with  two  resource  types:  a  single  central  processor  at  which 
queueing  occurs,  and  a  sufficient  number  of  identical  peripheral 
processors  so  that  all  requests  for  peripherals  may  be  granted 
immediately.  Price  shows  that  scheduling  the  central  processor 
SRPT  is  optimal  with  respect  to  both  central  processor 
utilization  and  peripheral  utilization. 

Price’s  results,  along  with  the  clear  dominance  of  SRPT-based 
rules  in  single-resource  systems,  indicate  that  it  might  be 
profitable  to  experiment  with  this  sort  of  discipline  in 
multiple-resource  systems  where  queueing  occurs  at  all 
processors.  This  provides  the  motivation  behind  the  present 
work . 
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2  •  approaches 

While  the  complexity  of  multiple-resource  computer  systems 
has  caused  a  paucity  of  theoretical  results,  there  is  an 
abundance  of  largely  obvious  heuristic  guidelines  to  the 
effective  scheduling  of  such  systems.  The  first  four  sections  of 
this  chapter  review  these  principles;  Section  2.5  discusses  their 
application  in  actual  computer  systems. 

2.  1  The  balanced  set  principle 

Two  fundamental  notions  proposed  by  [ Codd  1960]  and  'Denning 
1968],  "feasible"  program  mixes  and  "balanced  sets"  of  programs, 
lie  behind  most  of  the  heuristic  approaches  to  scheduling 
multi  pie- resource  computer  systems. 

Codd  points  out  that  computer  systems  are  both  time-  and 
space-shared.  While  a  time-shared  resource  may  be  over¬ 
committed,  in  the  sense  that  the  demands  of  the  active  programs 
are  greater  than  the  capacity  of  the  resource,  "real"  space- 
shared  resources  have  a  fixed  capacity  which  cannot  be  exceeded. 
Codd  defines  a  feasible  program  mix  to  be  a  set  of  programs  the 
sum  of  whose  space  requirements  is  less  than  the  available  total 
for  each  resource. 

It  is  desirable  to  select  for  multiprogramming  a  feasible  set 
of  programs  whose  total  requirements  come  as  close  as  possible  to 
equalling  the  capacity  of  each  space-shared  resource.  Codd 
presents  program  selection  and  placement  rules  that  strive  to 
meet  this  objective.  A  deficiency  of  the  scheme,  of  course,  is 
that  time-shared,  resources  (e.g.,  the  central  processor)  are 
ignored .  Furthermore ,  on  time- shared  resources  the  best  f eas ible 
mix  according  to  the  above  criterion  might  not  yield  highest 
total  utilization  because  of  dynamic  program  interactions,  which 
are  not  taken  into  account. 

Denning’s  theme  is  essentially  the  same,  but  he  formalizes 
resource  allocation  among  a  fixed  number  of  programs  with  known 
characteristics  as  dsomorphic  to  the  multidimensional  knapsack 
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problem.  Since  this  problem  is  NP-complete,  there  is  presently 
no  non-combinatorial  algorithm  that  provides  an  optimal  solution 
([Karp  1972]).  The  realities  of  program  arrivals  and  prngram 
characteristics  that  vary  and  cannot  be  precisely  measured  do  not 
contribute  to  the  ease  of  solution. 

Denning  considers  both  time-  and  space-shared  resources,  and 
informally  defines  a  balanced  set  of  programs  to  be  a  set  that 
dynamically  utilizes  all  resources  well.  Denning’s  major  concern 
is  with  the  interaction  of  virtual  memory  allocation  and  central 
processor  allocation.  His  working  set  model  for  program  memory 
reference  behavior  is  a  predictive  tool  for  obtaining  a  balanced 
set  of  programs. 

A  more  formal  notion  of  a  balanced  system  is  hinted  at  in 
[ Rodriguez-Rosell  1972].  He  defines  the  "operating  point"  of  a 
system  to  be  that  point  in  the  imult idimensional  resource 
utilization  space  that  corresponds  to  instantaneous  utilization 
levels.  Bounds  may  he  established  on  a  desirable  region  of  this 
space,  and  the  system  may  then  strive  to  maintain  its  operating 
point  within  this  region  by  some  means. 

2 . 2  Feedback  scheduling 

There  can  be  little  argument  that  scheduling  in  accordance 
with  the  balanced  set  principle  is  a  reasonable  objective. 
Unfortunately,  it  is  not  clear  how  to  select  programs  and 
scheduling  rules  that  are  compatible  in  the  sense  that  they 
result  in  a  system  operating  ppint  within  the  desirable  region. 

While  it  might  be  possible  to  select  a  balanced  set  of 
programs  on  the  basis  of  user-supplied  resource  demand  data,  this 
method  has  several  drawbacks.  It  addresses  too  high  a  level, 
since  users  clearly  cannot  estimate  the  burst  times  of  their 
programs  at  various  resources.  Alsb,  it  is  static,  in  that 
program  interactions  are  ignored. 

Mternatively ,  the  system  might  dynamically  collect  data  on 
program  behavior,  as  discussed  in  Section  1.3,  and  on  the  basis 
of  this  information  select  a  balanced  set  of  programs  to  be 
concurrently  processed  during  a  forthcoming  time  interval.  Under 
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a  scheme  such  as  this,  errors  in  estimates  and  unforeseen  program 
interaction  effects  could  be  compensated  for  by  a  feedback 
mechanism,  resulting  in  a  self-regulating  scheduler.  It  is 
possible  to  make  necessary  adjustments  at  either  the  macro- 
scheiuling  level,  by  altering  the  program  mix,  or  the  micro¬ 
scheduling  level,  by  varying  the  scheduling  disciplines  employed 
at  the  various  resources.  [Bunt  1974]  studies  these  techniques, 
both  of  which  have  been  tried  in  experimental  systems  (e.g., 
r  f^unck  1971])  , 

2.3  Wulti-level  schedulinq 

A  number  of  authors  have  advocated  the  separation  of  the 
scheduling  function  into  several  different  elements  distinguished 
primarily  by  time  scale:  the  frequency  at  which  scheduling 
decisions  are  re-evaluated  (equivalently,  the  time  interval  over 
which  a  scheduling  decision  is  binding).  The  arguments  advancei 
in  support  of  this  viewpoint  include,  by  analogy  to  structured 
programming,  the  use  of  scheduling  levels  as  structural  and 
conceptual  tools  ([Brinch  Hansen  1972]),  and  the  feeling  that 
many  of  the  same  considerations  that  force  deadlock  handling 
strategies  to  work  on  a  number  of  different  time  scales  (e.g., 
the  ease  with  which  a  resource  may  be  preempted  and  the 
opportunity  cost  involved  in  having  a  resource  idle)  are 
applicable  to  more  general  scheduling  problems. 

Operating  systems  have  made  at  least  implicit  use  of  multi¬ 
level  scheduling  for  a  long  while.  Generally,  memory  is 
allocated  in  advance  of,  and  for  a  longer  period  than,  the 
central  processor,  and  peripheral  devices  are  sometimes  allocated 
before  memory  (fHavender  1968]  discusses  this  in  relation  to 
IBM's  OS/360  MVT) .  While  this  distributed  nature  of  scheduling 
decisions  may  be  desirable,  certain  overall  strategies  are 
required;  if  the  decision  to  allocate  memory  to  a  program  is  made 
in  the  absence  of  information  about  the  availability  of  other 
resources,  for  example,  processor  idleness  will  likely  be  the 
result . 

It  is  possible  that  the  application  of  multi-level  scheduling 
techniques  may  help  to  resolve  some  of  the  inherent  conflicts 
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between  program-o rient:e(a-  and  processor-oriented  objective 
functions.  [Lan  19'^1]  and  [Browne  1972]  report  on  a  simulation 
study  of  the  interaction  between  memory  (or  long  term,  or  macro) 
scheduling  decisions  and  central  processor  (or  short  term,  or 
micro)  scheduling  decisions.  It  was  found  that  allocating  memory 
according  to  either  a  shortest  processing  time  first  or  smallest 
cost  first  (where  cost  =  memory  requirement  x  processing  time) 
discipline  results  in  a  program  completion  rate  more  than  twice 
as  great  as  using  the  smallest  memory  size  first  or  first-come- 
first-served  disciplines,  and  that  this  effect  tends  to  dominate 
the  variation  introduced  by  different  micro-scheduling 
disciplines. 

Several  factors  contribute  to  the  small  impact  of  micro¬ 
scheduling  in  ban's  study.  The  objective  function  may  have 
favored  macro-scheduling;  Section  6,5  will  look  at  this  in  more 
detail.  The  program  mix  used  was  heavily  CPU  bound  and  the  2DC 
6600  which  was  modelled  has  tremendous  peripheral  processor 
power,  both  of  which  tend  to  diminish  the  importance-  of  CPU 
scheduling.  Further  experiments  along  this  line  might  serve  to 
confirm  the  intuitive  feeling  that,  given  proper  coordination, 
macro-scheduling  can  have  a  dominant  effect  on  program-oriented 
objective  functions  while  micro- sched uling  plays  the  determining 
role  in  performance  according  to  processor-oriented  objective 
functions. 

2.4  Wahl's  work 

[Wahl  1970]  begins  from  the  premise  that  users  can  and  will 
accurately  forecast  the  total  demands  that  their  programs  place 
upon  various  system  resources.  He  characterizes  a  system  of  N 
programs  and  M  resources  by  a  pair  of  NxM  matrices: 

A  where  ?iv(i,i)  is  the  proportion  of  its  time  that  program 
i  is  expected  to  spend  at  resource  j;  and 

P  where  P(i,j)  is  the  unique  internal  priority  of  program 
i  at  resource  j,  and  P(i,j)  <  P(k,j)  means  that  program 
i  has  preemptive  priority  over  program  k  at  resource  j; 

and  by  two  Uxl  vectors: 

W  where  W(i)  is  the  progress  rate  of  program  i,  the  ratio 
of  its  ’’virtual"  time  to  clock  time,  0  <  VI  (i)  <  1;  and 
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C  where  C(i)  is  the  urgency,  or  external  priority,  of 
program  i. 

In  Wahl’s  scheme  a  resource  utilisation  vector  and  an 
external  priority  must  initially  be  provided  for  every  program. 
Mahl’s  scheduler  then  attempts  to  find  values  for  P,  the  matrix 
of  internal  priorities,  and  W,  the  progress  rate  vector,  such 
that  some  economic  criterion  (e.g.,  the  sum  over  all  programs  of 
the  product  of  external  priority  and  progress  rate)  is  maximised. 
The  progress  rate  values  that  the  scheduler  assigns  are 
constrained  to  those  that  can  be  met  even  under  worst-case  inter- 
program  resource  conflict. 

Like  many  other  authors,  Wahl  divides  the  scheduling  problem 
into  several  different  operations  distinguished  by  time  scale.. 
His  macro- scheduler  is  invoked  occasionally  to  compute  P  and  W; 
his  micro-scheduler  enforces  P,  A  and  W  by  giving  preemptive 
priority  at  each  particular  resource  j  to  the  program  i  with  the 
least  P(i,j),  and  by  preempting  program  i  if  it  uses  any  resource 
j  more  than  W(i)?Lfd,i)t  in  time  interval  t. 

The  philosophy  of  considering  the  demands  that  programs  place 
upon  various  resources  in  making  scheduling  decisions  is  an 
intelligent  one,  but  Wahl  fails  in  several  important  respects. 
Primarily,  it  seems  intuitively  clear  that  the  guantity  of 
interest  in  micro- schedul ing  is  not  the  percentage  of  its  time 
that  a  particular  program  spends  at  each  resource,  but  rather  the 
actual  burst  time  that  is  characteristic  of  each  program  at  each 
resource.  Mahl’s  preoccupation  with  percentages  leads  to  a  proof 
(his  Appendix  B)  that  if  only  one  resource  is  being  competed  for 
and  loss  functions  are  identical,  the  optimal  priority  assignment 
is : 

P(i,j)  <  P(k,j)  <=>  A(i,j)  <  A(k,j) 

(Recall  that  a  lower  priority  value  indicates  higher  priority.) 
But  this  is  a  case  in  which  SRPT  is  optimal  (f  Schrage  1968  ])  and 
Mahl’s  theorem  says  nothing  about  actual  program  burst  intervals. 

Mahl’s  assumption  of  maximum  in  ter- program  resource  conflict 
also  has  interesting  side  effects,  leading  to  a  proof  (his 
Appendix  A)  that  ”the  assignment  to  each  job  of  a  set  of  the  same 
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priorities  for  all  resources  is  a  wrong  choice,  which  can  always 
be  improved.”  ?ls  an  example,  Mahl  cites  a  system  of  two  programs 
and  two  resources.  Each  program  spends  half  of  its  time  at  each 
resource,  but  one  program  has  much  shorter  burst  times  than  the 
other.  This  latter  fact  is  ignored  by  Hahl’s  algorithm,  which 
assigns  top  priority  at  each  resource  to  a  different  program,  an 
apparent  contradiction  of  his  earlier  statement  that  "better 
simultaneity  in  resource  usage  and  between  jobs'  progresses  is 
achieved  by  assigning  a  high  priority  for  a  resource  to  a  job 
which  will  mak-e  little  use  of  the  resource.”  Adhering  to  this 
latter  principle  would  dictate  giving  priority  on  both  resources 
to  the  program  with  the  shorter  burst  times. 

The  guestions  that  Mahl  considers  are  interesting  and 
important  ones,  even  if  his  answers  are  frequently  less  than 
totally  satisfactory.  Section  5.4  of  this  thesis  will  show  that 
his  "relative  utilization"  arguments  have  more  basis  than 
intuition  might  grant  them. 

2.5  Experiments  in  multiple-resource  scheduling 

The  traditional  approach  to  scheduling  multiple-resource 
computer  systems  has  been  to  employ  the  first-come-f irst- served 
discipline  at  all  peripheral  processors  and  a  rule  based  upon 
external  priorities  at  the  central  processor.  Until  recently, 
most  of  the  practical  efforts  directed  at  improving  this 
situation  have  dealt  with  increasing  I/O  -  CPU  overlap,  and  have 
taken  the  form:  "I  gave  highest  central  processor  priority  to 
I/O  bound  programs,  and  look  what  it  bought  me."  Among  the 
systems  that  have  been  treated  in  this  manner  are  IBM's  3S/360 
([Marshall  1969]  and  [Pyder  197C])  and  TSS  ([Doherty  1970  ]),  and 
CDC*  s  Chippewa  ([Stevens  1968  ]  and  [Abell  1970]),  Several 
current  commercial  systems  (e.g,  IBM's  irs-2  ([Ryder  1970])) 
include  a  scheme  of  this  sort. 

The  University  of  "^exas  at  Austin's  UT-2  operating  system  for 
the  CDC  6600,  briefly  described  in  [Sherman  1972a],  is  an 
interesting  example  of  a  system  with  an  efficient,  distributed, 
multi-level  scheduler.  There  are  actually  four  autonomous 
schedulers,  each  potentially  employing  a  different  discipline. 
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The  program  (or  main  memory)  scheduler  assigns  an  available 
"control  point"  (which  carries  with  it  the  right  to  request  other 
resources)  to  the  lowest  cost  program  (where  cost  =  memory 
requirement  x  processing  time)  that  will  fit  in  available  memory. 
The  central  processor  scheduler  uses  a  modified  round-robin  rule, 
selecting  from  among  the  programs  at  control  points.  The  five 
peripheral  processing  units  are  allocated  first-come-f irst-served 
to  programs  that  request  them,  and  disks  are  allocated  round- 
robin  fashion  (with  an  infinite  quantum)  to  peripheral  processors 
that  request  them,  Lan's  simulation  study  (Section  2.3)  reveals 
the  value  of  the  main  memory  scheduling  discipline  that  is 
employed.  The  original  Control  Data  system  allocated  control 
points  non-preemptive ly  based  upon  external  priority,  without 
regard  to  memory  avai l.ability .  The  result,  the  high  degree  of 
memory  idleness  described  in  f Stevens  1968],  illustrates  the 
importance  of  coordination  between  the  various  levels  in  a 
scheduling  hierarchy. 


Much  of  the  work  on  multiple- resource  scheduling  has  arisen 
from  the  ashes  of  paged  memory  systems.  With  the  realization 
that  memory  utilization  levels  (measured  by,  say,  U  =  percentage 
of  'real  memory  blocks  referenced  per  unit  time)  were 
extraordinarily  low  in  conventional  fixed  memory  systems,  paged 
memory  hierarchies  were  introduced.  The  tradeoffs  involved  are 
very  difficult  to  evaluate;  memory  utilization  is  being  purchased 
at  the  expense  of  some  central  processor  and  channel  overhead. 
Sines  there  is  a  very  high  limit  on  the  capacity  of  a  virtual 
resource  there  is  a  tendency  to  over-commit  it.  In  the  case  of 
paged  memory  systems,  this  over-commitment  is  manifested  by 
"thrashing":  excessive  page  transfers  and  resultant  high  central 

procsssor  and  channel  overhead.  Most  schedulers  for  paged 
systems  now  attempt  to  determine  that  the  "working  set"  of  a 
program  (a  frequently  defined,  if  not  well  defined,  conospt)  will 
fit  in  available  memory  before  dispatching  the  program.  Several 
modifications  to  IBM’s  CP-6'7  that  incorporate  this  technique  have 
been  described  in  the  literature  ([ Munck  1971]  and  [Bodriquez- 
Posell  19731) ;  TSS,  VS-2  and  Multics  also  make  use  of  it. 


Some  reasonably  sop-hist icated  feedback  schedulers  that 
attempt  to  achieve  balance  by  capitalizing  on  knowledge  of 
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various  facets  of  program  behavior  in  addition  to  memory 
requirements  have  been  implemented  (cf.  [ Munck  1971]).  The 
overhead  involved  in  data  collection  and  the  limited  knowledge 
concerning  what  should  be  done  with  a  program  of  particular 
characteristics  have  impeded  progress  in  this  area,  however. 

Finally,  while  it  is  not  specifically  related  to  multiple- 
resource  systems,  a  fair  amount  of  experimentation  has  been 
devoted  to  the  problem  of  decreasing  preemption  overhead. 
[Bernstein  1971],  [Mullery  1970]  and  [  Bhat  1  973  ]  all  address  this 
problem . 


3. 


O  * 


A  thorough  treatment  of  processor  scheduling  models  of 
computer  systems  ife  unfortunately  missing  from  the  literature. 
This  chapter  will  briefly  set  out  the  fe-atures  and  assumptions 
frequently  encountered  in  models  of  computer  systems  and  will 
survey  several  of  the  more  sophisticated  models  of  multiple- 
resource  systems.  Chapter  Pour  discusses  both  general  solution 
techniques  and  the  specific  solution  for  the  model  used  in  this 
thesis.  [Coffman  1973]  is  an  excellent  introduction  to  general 
models  of  computer  systems  and  their  control  algorithms.  A 
concise  introduction  to  simple  queueing  models  and  the  required 
probability  theory  appears  in  [Hillier  1967].  Queueing  models  of 
job  shop  scheduling  problems  are  thoroughly  discussed  in  [Conway 
1967].  [McKinney  1969]  is  a  brief  but  good  overview  of 
analytical  time-sharing  models.  Buzen's  thesis  ([ Buzen  1971]) 
contains  an  excellent  survey  of  queueing  network  models  applied 
to  general  computer  systems. 

^s  the  name  suggests,  queueing  models  are  used  to  study  the 
behavior  of  waiting  lines.  Like  any  model,  whether  it  be 
physical,  simulation,  stochastic  or  mathematical,  a  queueing 
network  model  may  be  employed,  for  any  of  a  number  of  reasons. 

T^’or  example,  the  system  that  is  to  be  studied  may  not  presently 
exist,  it  may  not  be  possible  to  properly  control  the  environment 
of  the  real  system,  or  it  may  be  too  costly,  in  either  financial 
or  human  terms,  to  conduct  experiments  on  the  actual  system. 

A  model  is  generally  constructed  to  investigate  certain 
specific  behavioral  phenomena  of  a  real  system.  It  is  the 
essence  of  modelling  to  abstract  precisely  those  characteristics 
of  the  system  and  its  environment  that  are  relevant  to  the  tapics 
being  investigated  and  to  include  only  these  in  the  model. 

Crucial  to  this  task  is  an  understanding  of  the  level  of  detail 
at  which  the  system  must  be  studied. 

A.n  actual  computer  system  consists  of  a  large  number  of 
resources  (channels,  central  processor,  etc.)  which  will  be 
called  processor  for  the  purposes  of  this  discussion.  A^t  a  high 
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Simplifying  assumptions  are  made  in  models  for  one  of  two 
reasons;  to  remove  clutter  that  is  judged  to  be  irrelevant  to 
those  properties  of  the  system  that  are  being  investigated  or  to 
achie-ve  mathematical  or  computational  tractability . 
Simplifications  of  the  second  type  are  much  more  dangerous,  and 
will  he  discussed  first. 


Various  quantities  in  a  queueing  network  model  are 
represented  by  stochastic  variables.  These  include  program 
interarrival  times  and  service  times.  Service  times  may  in  turn 
be  represented  at  any  desired  level  of  detail:  total  processing 
time,  total  central  processor  time,  total  time  at  each  processor, 
or  burst  time  at  each  processor.  It  is  obviously  desirable  to 
have  the  theoretical  probability  distributions  that  are  used  to 
model  these  quantities  fit  as  closely  as  possible  the  empirically 
measured  data.  The  difficulty  of  this  is  discussed  in  [Lester 
1973]. 


rrntil  very  recently,  the  vast  majority  of  computer  system 
models  attempted,  to  represent  all  programs  by  one  set  of 
probability  distributions.  In  other  words,  it  was  assu.T\ed  that 
the  interarrival  times  and  service  times  of  all  programs  were 
each  independent,  identically  distributed  random  variables.  It 
was  also  generally  assumed  that  these  variables  were 
exponentially  distributed. 

In  any  modelling  study,  it  is  necessary  to  assess  both  the 
correctness  and  the  implications  of  such  simplifying  assumptions. 
In  [Seveik  1974]  the  three  common  assumptions  listed  abave  are 
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discussed  in  some  detail.  The  material  that  follows  is  an 
expansion  of  the  points  made  in  that  abstract. 


•  The  form,  of  service  time  distributions 
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pplication  of  a  probability  distribution  determines 
its  moments  are  of  consequence.  At  the  very  least,  the 
variance  are  usually  significant.  Informally,  the 
indicates  '’how  close"  successive  samples  are  apt  to  be 
other. 


As  difficult  as  it  may  be  to  fit  a  theoretical  probability 
distribution  to  empirical  service  time  data,  it  is  even  harder  to 
cope  mathematically  with  most  theoretical  distributions.  Because 
of  its  "memoryless"  property  the  exponential  distribution  is 
"nice"  from  the  point  of  view  of  analytic  tractabi lity ,  and  it  is 
therefore  often  assumed  in  queueing  network  models  that 
stochastic  variables  are  exponentially  distributed. 


Unfortunately,  the  exponential  distribution  bears  but  scant 
resemblance  to  most  empirically-observed  distributions.  A  number 
of  studies  (e.g.,  [Baskett  1971])  claim  that  total  CPU  and  I/O 
processing  times  have  higher  and  lower  variances  than  the 
exponential  distribution,  respectively.  Ignoring  higher  moments 
allows  these  quantities  to  be  represented  by  hyperexponential  and 
hypoexponential  distributions.  Since  both  of  these  distributions 
can  be  synthesized  by  stages  of  exponential  servers  they  can  be 
handled  without  undue  difficulty.  This  technique,  while 
important,  will  not  be  used  in  this  thesis  and  a  description  is 
therefore  omitted,  Sevcik  (f Sevcik  1972b])  and  Baskett  are  among 
the  authors  who  have  studied  hype rexponentially  distributed  CPU 
service  times. 


A.n  extensive  study  of  CPU  service  time  distributions  is 
presented  in  [Lester  1973],  She  concludes  that  a  four-parameter 
distribution  is  required  to  accurately  fit  CPU  service  time  data 
that  has  been  preconditioned  on  explicitly  provided  parameters. 

The  assumptions  made  about  the  form  of  a  distribution  relate 
to  assumptions  about  sample  independence  and  the  use  of  identical 
distributions.  As  an  example  of  this,  consider  the 
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representation  of  the  processor  burst  times  of  programs.  At  the 
most  coarse  level,  a  single  distribution  might  be  used  to 
represent  the  burst  times  of  all  programs  at  all  processors.  -  A 
more  refined  model  would  use  a  separate  distribution  to  represent 
all  burst  times  at  each  processor.  Carrying  this  refinement  one 
step  further  would  involve  representing  the  burst  times  of  each 
program  at  each,  processor  by  a  separate  distribution'. 

Even  at  this  level  of  detail  there  are  difficulties.  * 
Evidence  shows  that  the  burst  times  of  a  single  program  at  the 
central  processor  have  a  fairly  high  variance,  or  at  least  a 
variance  greater  than  that  of  an  exponential  distribution.  Dn 
the  other  hand,  the  data  in  [Sherman  1972a']  and  [Sherman  1972b] 
indicates  that  there  is  significant  correlation  between  the 
successive  CPU  burst  time  intervals  of  a  program.  In  other 
words,  while  the  burst  times  over  the  life  of  a  program  have  a 
high  variance,  a  sample  consisting  of  a  relatively  amall  HiM»ber 
of  successive  burst  intervals  has  a  low  variance.  Smecessive 
burst  times  are  not  independent  of  one  another. 

•  Independence  of  successive  samples 

As  mentioned  above,  the  correctness  of  assuming  that 
successive  values  of  a  stochastic  variable  are  independent  of  one 
another  -is  related  to  the  form  of  the  probability  distribution 
from  which  samples  are  drawn.  This  is  because  u^^sg  ^ 
distribution  with  a  low  variance  implies  a  certari-n^ 'correlation  ‘ 
between  successive  samples,  while  using  a  distribution  with  a 
high  variance  implies  total  independence. 

If  a  probability  distribution  is  used  to  represent  the  'total 
processing  times  cf  the  various  programs  in  a  system  then  it  may 
be  realistic  to  assume  that  successive  samples  are  independent. 

At  a  lower  level,  though,  the  successive  CPU  burst  intervals  of  a 
particular  program  seem  to  be  correlated.  The  success  of  the 
feeaback  schedulers  discussed  in  Chapter  Two,  (e.g.,  [Doherty 
1970],  [Marshall  1969],  [ Munck  1971],  [ Rodriguez-Rosell  1972], 
[Ryder  1970]  and  [Stevens  1968])  supports  this  contention  for  I/O 
and  paging  as  well  as  CPU  behavior. 
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If  a  low- varianc?  distribution  is  used  to  model  the  burst 
times  of  a  program,  then  selecting  successive  samples  from  the 
sair^e  distribution  may  imply  minimal  independence.  This 
approach's  weakness  is  that  it  assumes  a  low  variance  in  burst 
times  across  the  life  of  the  program. 

•  The  use  of  identical  distributions 

Programs  can  be  roughly  grouped  into  classes  based  upon  their 
service  t:ime  requirements,  although  the  parameters  upon  which  to 
a  nriori  base  such  a  partitioning  are  somewhat  in  doubt. 

Lester's  results  in  this  area  were  mentioned  in  Section  1.3. 
Scheduling  rules  that  make  use  of  explicit  or  implicit  service 
time  information  are  advantageous  according  to  almost  any 
obgective  function. 

To  assume  that  the  total  processing  times  of  all  programs  are 
identically  distributed  (i.e.,  that  they  have  the  same  form  and 
parameter  values,  or  are  selected  from  the  same  distribution)  is 
a  simplification  that  restricts  the  scheduling  problems  that  can 
be  investigated  to  those  involving  the  least  sophisticated  of 
disciplines. 

A.t  the  level  of  program  burst  times,  it  is  unrealistic  to 
assume  that  the  burst  times  of  one  program  at  all  processors  or 
o-p  all  programs  at  one  processor  are  identically  distributed.  To 
use  a  single  distribution  to  represent  the  burst  times  of  one 
program  at  one  processor  is  better,  but,  as  previously  discussed, 
is  still  not  entirely  realistic.  Perhaps  several  samples  should 
be  taken  from  one  distribution,  then  several  from  another 
distribution,  and  so  on. 

A  large  number  of  simplifying  assumptions  fall  into  the 
"feature  judged  irrelevant"  category  described  above.  Even  more 
lie  somewhere  between  the  two  assumption  types. 

^-s  more  processors  are  added  to  a  model,  analysis  becomes 
more  complex.  A.s  a  result,  most  models  claim  to  be  CPU 
scheduling  models.  When  additional  processors  are  added  and  are 
distinguished  from  one  another,  queueing  is  often  allowed  to 
occur  at  only  one  processor;  it  is  assumed  that  the  other 
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processors  are  identical  and  that  there  are  enough  units  to 
satisfy  all  requests.  A.11  program's  are  sometimes  assumed  to 
visit  all  processors  with  the  same  frequency. 

Certain  scheduling  disciplines  (specifically,  those  that 
involve  the  order  in  which  programs  request  service  at  a 
processor,  such  as  f irst-come-f irst-served  and  last-come-f irst- 
served)  are  more  difficult  to  model  than  others.  This  will  be 
discussed  in  Section  4.2.  Sometimes  it  is  assumed  that  the  same 
scheduling  discipline  will  be  used  at  all  processors.  Finally, 
preemption  overhead  is  usually  ignored. 

These  simplifying  assumptions  are  not  a  £riori  ’’good"  or 
"bad.  "  In  every  case,  the  net  gain  or  loss  depends  upon  the 
system  being  modelled  and  the  level  of  detail  at  which  that 
system  is  being  studied.  It  is  useful,  however,  to  classify 
various  models  on  the  basis  of  these  assumptions,  as  follows: 

•  How  many  processors  are  modelled? 

•  Do  queues  form  at  all  processors? 

•  Can  processors  be  independently  scheduled? 

•  What  forms  of  distributions  are  used? 

•  A.re  successive  service  times  independent? 

•  Do  all  programs  have  the  same  characteristics? 

•  Do  they  visit  the  same  processors? 

•  Are  their  service  times  identical  at  some  processors? 

•  Which  service  disciplines  can  be  represented? 

•  Is  preemption  overhead  accounted  for? 

The  level  of  detail  at  which  a  system  is  viewed  forms  twa 
more  criteria  by  which  to  differentiate  queueing  models: 

•  What  objective  functions  are  studied? 

•  Are  arrivals  and  departures  considered  (an  open  model) ,  or  do 
programs  circulate  indefinitely  (a  closed  model)? 

This  thesis  is  aimed  at  studying  the  low-level  behavior  of 
computer  systems  with  more  than  one  scarce  resource.  In  the 
following  sections,  those  models  in  the  literature  that  are  most 
relevant  to  this  subject  will  be  analyzed.  Both  the  modelling 
techniques  and  the  results  obtained  are  of  interest. 


r Mitrani  1972a]  models  a  system  of  two  processors  that  allows 
a  fixed  degree  of  multiprogramming  among  programs  with  different 
service  time  characteristics.  All  service  times  are  assumed  to 
be  exponentially  distributed.  The  goal, of  his  investigation  is 
to  determine  under  what  conditions  a  preemptive  priority 
scheduling  discipline  is  advantageous  at  the  I/O  processor, 
assuming  that  the  I/O  transfer  is  resumed  rather  than  restarted 
after  a  preemption. 


In  a  closed  or  cyclic  model,  programs  neither  enter  nor  leave 
the  system.  Rather,  they  circulate  among  the  processors.  In 
Mitrani's  configuration,  shown  in  Figure  3.1,  a  program  that  has 
completed  service  at  the  CPU  (PO)  may  do  one  of  two  things.  With 
probability  q (program  identifier)  it  requests  service  at  the 
peripheral  processor  (PI)  .  With  probability  1  -  q(proaram 
identifier)  it  completes  service  and  is  immediately  replaced  by 
another  program  of  identical  characteristics. 
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Rather  than  experimenting  with  different  program 
characteristics  and  priority  assignments  and  then  measuring 
performance  against  various  objective  functions,  Mitrani  attempts 
to  answer  questions  of  the  following  sort:  If  the  highest 
priority  program  in  the  system  is  I/O  bound,  is  a  preemptive  or  a 
non- preemptive  discipline  at  the  I/O  processor  preferable  with 
respect  to  CPU  utilization?  He  thus  holds  external  priorities  to 


Figure  3 . 1 
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be  inviolable  in  scheduling  the  processors,  but  ignores  them  in 
formulating  his  objective  function. 


Mitrani’s  experiments  with  three  active  programs  lei  him  to  a 
number  of  observations  about  the  behavior  of  his  model: 


1.  With  preemptive  priority  practiced  at  both  processors  and 
assuming  identical  peripheral  processor  service  times  for  all 
programs,  the  "conventional  enlightened”  policy  of  giving 
highest  priority  to  those  programs  that  have  the  lowest  CPU 
service  times  (i.e.,  the  I/O  bound  programs)  results  in 
considerably  lower  CPU  utilization  than  certain  other 
policies ; 

2.  If  all  programs  have  the  same  service  time  characteristics, 
CPU  utilization  is  independent  of  scheduling  discipline; 

3.  If  low-priority  programs  are  heavily  I/O  bound,  CPU 
utilization  is  highest  when  preemptive  priority  is  practiced 
at  the  I/O  processor.  When  high-priority  programs  are 
heavily  I/O  bound,  a  non-preemptive  priority  discipline  is 
preferable. 


Witrani's  observations  are  intuitively  consistent  with  his 
assumptions.  With  regard  to  the  first  observation,  note  that 
since  a  program  has  the  same  priority  at  each  processor,  giving 
highest  priority  at  PO  to  those  programs  that  utilize  it  the 
least  results  in  giving  highest  priority  at  PI  to  those  programs 
that  utilize  it  relatively  more  heavily,  resulting  in  excessive 
gueueing  at  this  processor.  Mahl  (Section  2.4)  considered  this 
topic;  the  results  presented  in  Section  5.4  are  also  directed 
towards  the  concept  of  relative  processor  utilization. 


Observation  three  also  makes  intuitive  sense;  in  the  first 
case,  high-priority  programs  may  be  forced  to  gueue  for  a  long 
period  at  the  I/O  processor  if  non-preemptive  priority  is 
practiced  there. 

3.2  Buzen's  central  server  model 

Like  Mitrani,  [Buzen  1971]  considers  closed  models  of 
mult iprogrammed  systems.  Buzen’s  analysis  is  capable  of  handling 
multiple  peripheral  processors,  but  he  cannot  deal  with  programs 
of  different  characteristics.  The  name  "central  server"  derives 
from  the  fact  that  there  is  a  distinguished  processor,  meant  to 
represent  the  CPU,  to  which  programs  return  after  completing 
service  at  any  peripheral  processor. 
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Buzen's  model,  shown  in  Figure  3.2,  includes  one  central 
processor  and  an  arbitrary  number,  n,  of  peripheral  processors. 
When  a  program  completes  its  (exponentially  distributed)  service 
at  the  CPU  (PO) ,  it  either  terminates  with  probability  g(0)  and 
is  immediately  replaced  by  an  identical  program,  or  goes  to  one 
of  the  n  peripheral  processors  with  probability  q  (i)  ,  i  =  (1, 

...,  n) .  Obviously  the  sum  of  the  probabilities  q(0)  through 
q  (n)  must  equal  one. 

Service  times  at  the  peripheral  processors  are  also  presumed 
to  be  exponentially  distributed,  with  a  different  mean  at  each 
processor.  The  objective  function  is  mean  program  completion 
rate,  which  is  equivalent  to  central  processor  utilization  since 
programs  are  indistinguishable. 

Buzen  uses  his  model  to  analyze  three  sample  problems: 
optimal  buffer  size,  in  which  he  finds  that  slower  devices 
require  larger  buffers;  balancing  peripheral  processor 
utilization,  in  which  he  finds  that  under  optimal  balancing, 
faster  devices  will  have  longer  average  queue  lengths  than  slower 
devices;  and  page  traffic  balancing,  in  which  he  finds  that  a 
good  page  replacement  strategy  is  of  more  relative  benefit  with  a 
fast  paging  processor  than  with  a  slow  one. 

The  inability  to  distinguish  between  programs  limits  the 
usefulness  of  Buzen's  model  for  investigating  novel  scheduling 
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disciplines.  The  central  server  concept,  though,  represents  an 
important  advance  in  the  direction  of  more  realistic  queueing 
models. 


3.3  Baskett’s  sequence  of  queueing;  network  models 


[ Baskett  1971]  develops  a  series  of  queueing  network  models, 
many  of  the  central  server  type  and  each  a  more  accurate 
approximation  of  actual  multiprogramming  systems  than  the  last. 
In  particular,  some  of  his  models  include  other  than 
exponentially  distributed  service  times. 


Like  Buzen,  Baskett  does  not  attempt  to  distinguish  between 
programs.  Central  processor  utilization  is  his  objective 
function.  In  the  course  of  formulating  his  models,  Baskett  makes 
the  following  interesting  claims: 


1.  Improvement  of  processor  utilization  is  the  primary  goal  of 
multiprogramminq.  The  degree  to  which  this  improvement  is 
realized  depends  upon  (at  least)  the  following  factors: 

a.  degree  of  multiprogramming, 

b.  number  and  types  of  I/O  processors, 

c.  lengths  of  CPU  and  I/O  bursts, 

d.  regularity  and  distributions  of  these  burst  periods, 

e.  degree  to  which  a  program  can  overlap  its  own  I/O  and 
CPU  processing, 

f.  degree  of  program  similarity  or  dissimilarity, 

g.  preemption  overhead  for  each  processor; 

2.  Empirical  studies  show  that  the  dispersion  or  variance  of  I/O 
service  times  is  often  much  less  than  that  of  an  exponential 
distribution.  In  selecting  a  more  appropriate  distribution 
to  model  I/O  requests,  it  is  desirable  to  use  an  exponential¬ 
like  distribution  in  order  to  maintain  tractability.  The 

h y poexponential  distribution  is  therefore  the  most 
appropriate  one; 

3.  CPU  service  time  variance  is  often  greater  than  that  of  the 
exponential  distribution.  By  the  above  argument,  the 
hyperexponential  distribution  seems  most  appropriate  to  model 
these  service  times. 


The  points  under  the  first  statement  indicate  the  extent  of  the 
factors  that  must  be  considered  in  constructing  a  realistic  model 
that  will  accurately  answer  questions  about  general 
multiprogramming  systems.  Observations  two  and  three  are 
directed  at  the  simplifying  assumptions  discussed  in  the 
introduction  to  this  chapter. 


The  various  features  incorporated  into  Baskett' s  models  are 
listed  below.  In  general,  as  one  constraint  is  relaxed  another 
is  tightened: 
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1.  models  have  a  fixed  number  of  orograms  with  service  time 
characteristics  and  required  processors  selected  from  the 
same  distributions; 

2.  A.11  models  have  a  single  central  processor; 

3.  Either  there  are  as  many  identical  peripheral  processors  as 
there  are  active  programs,  in  which  case  no  queueing  occurs 
except  at  the  central  processor,  or  there  is  a  fixed,  limited 
number  of  peripheral  processors; 

4.  Peripheral  processors  are  either  indistinguishable  (a  common 
queue  is  used  for  all  peripherals,-  or  there  is  no  queueing  at 
all),  or  they  each  have  their  own  queue; 

5.  The  service  time  distribution  at  each  peripheral  processor  is 
either  exponential  or  hypoexponential ; 

6.  The  CPU  service  time  distribution  is  either  exponential  or 
h  y perexponential ; 

7.  The  central  processor  scheduling  discipline  is  either  first- 
come-f irst-served  or  processor  sharing.  Uhen  peripheral 
processor  queueina  is  allowed,  first-come-f irst- served  is 
always  practiced. ' 


Baskett  makes  two  observations  about  the  performance  of  his 
models  that  relate  to  the  simplifying  assumptions  about  service 
times : 


1.  For  a  particular,  not  totally  realistic  model,  the  difference 
in  CPU  utilization  with  hypoexponentially  distributed  T/3 
service  times  and  with  exponentially  distributed  I/O  service 
times  was  nealigible.  If  this  could  be  shown  to  be  generally 
true,  then  a' subst ant ial  simplification  would  be  possible, 

2.  For  another  not  totally  realistic  model,  a  hyperexponential 
distribution  of  CPU  service  times  along  with  processor 
sharing  resulted  in  the  same  CPU  utilization  as  exponential 
service  times  with  first-come-f irst-served  scheduling. 
(Utilization  under  FCPS  with  hyperexponential  service  times 
was  much  lower.)  This  provides  a  useful  bound  on  the 
improvement  to  be  gained  by  round-robin  scheduling,  of  which 
processor  sharing  is  the  limiting  case. 


The  treatment  of  non-exponential  service  time  distributions 
represents  an  important  innovation.  Certain  other  interesting 
problems  (e.g.,  distinguishable  programs  and  additional  service 
disciplines)  are  not  addressed.  In  the  paper  discussed  in  the 
next  section,  Baskett  extends  his  work  to  encompass  some  other 
conventional  scheduling  disciplines. 


3 .  The  general  model  of  Muntz  and  Baskett 


The  extensions  to  computer  system  models  provided  by  this 
work  ([Muntz  1972  1)  allow  open,  closed  and  mixed  networks  of 
queues  with  several  distinguishable  programs  and  multiple 
scheduling  disciplines.  In  addition  to  formalizing  very  well  the 
notion  of  what  constitutes  a  state  under  various  scheduling 
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rules,  the  paper  applies  a  fairly  novel  solution  technique, 
Chan3y*s  "local  balance"  concept,  to  enable  complex  models  to  be 
analyzed  in  a  reasonable  amount  of  time. 

The  goal  of  this  work  is  the  accurate  representation  of  some 
specific  computer  system  rather  than  general  experimentation. 

The  following  processor  types  and  scheduling  disciplines  may 
appear  in  their  model; 


1 . 


2. 


3. 


4. 


A  single  processor  scheduled  according  to  the  first-come- 
first-served  rule,  where  all  programs  have  the  same 
exponentially  distributed  service  time  with  mean  a  function 
of  the  number  of  programs  in  the  queue; 

A  single  processor  scheduled  according  to  the  processor 
sharing  discipline*  where  each  program  may  have  a  unique 
service  time  distribution  of  any  form  which  possesses  a 
rational  Laplace  transform  (this  includes  most 
distributions) ; 


A  number  of  processors  in  excess  of  the  number  of  program 
so  that  no  gueueina  occurs.  Each  program  at  a  processor 
this  type  may  have'a  unique  service  time  distribution  of 
form  with  a  rational  Laplace  transform. 


o^ 

any 


A  single  processor  scheduled  preemptive  last-come-f irst- 
served,  where  each  program  may  have  a  unique  service  time 
distribution , of  any  form  with  a  rational  Laplace  transform. 


A  model  may  include  an  arbitrary  number  of  distinguishable 
program  classes.  The  programs  pass  from  processor  to  processor 
and  change  classes  according  to  transition  probabilities.  In  the 
case  of  an  open  program  class,  the  arrival  rate  is  Poisson  and 
the  mean  can  depend  upon  one  of  several  factors. 


It  is  shown  that  the  ability  of  a  program  to  change  class  and 
processor  probabilistically  is  powerful  enough  so  that  any  system 
that  can  be  represented  by  the  full  model  can  be  reasonably 
approximated  by  a  transformed  representation  in  which  all  program 
classes  have  the  same  exponentially  distributed  service  times  at 
a  particular  processor;  in  other  words,  each  processor  has  one 
exponentially  distributed  service  time  associated  with  it.  This 
considerably  simplifies  the  solution  of  the  model. 

The  first  step  in  creating  this  new  representation  involves 
the  introduction  of  new  program  classes.  A  program  which  has  a 
non-exponential  service  time  distribution  at  a  processor  may  now 
be  "simulated"  by  having  that  program  change  classes  in  the 
course  of  its  service  at  a  single  exponential  processor.  Once 
all  service  time  distributions  have  been  made  exponential  by  this 
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transformation,  transition  probabilities  are  modified  so  that 
each  program,  no  matter  what  class  it  may  belong  to,  has  the  same 
mean  service  time  at  each  processor. 


While  the  four  sc 
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3.5  Anderson’s  model  with  N  distinguishable  tasks 

From  the  point  of  view  of  understanding  program  interactions 
and  the  behavior  of  scheduling  rules  in  systems  with  more  than 
one  scarce  resource,  A^nderson’s  work  ([Anderson  1973  ])  is  the 
most  thought-provoking  thus  far  encountered. 

Anderson’s  model  is  a  relatively  simple  one.  It  is  a  closed 
central  server  model  consisting  of  a  central  processor  and  a 
fixed  number  of  peripheral  processors.  A  fixed  number  of  program 
classes,  each  containing  one  program,  are  multiprogrammed .  Each 
program  class  has  its  own  exponential  service  time  distributions 
at  the  central  processor  and  at  each  peripheral  processor,  and 
additionally  has  its  own  set  of  peripheral  processor  selection 
probabilities. 

Anderson  justifies  the  use  of  exponential  service  tine 
distributions  by  claiming  that  empirical  evidence  has  established 
only  that  the  aggregate  burst  times  of  all  programs  in  a  system 
are  not  exponentially  distributed.  He  contends  that  this  results 
from  the  differing  requirements  of  the  various  programs,  and  that 
the  burst  times  of  a  particular  program,  which  his  service  times 
are  meant  to  represent,  might  still  be  exponentially  distributed. 
(This  same  assumption  will  be  made  in  the  model  used  in  this 
thesis . ) 

The  primary  thrust  of  the  paper  is  two-fold:  to  demonstrate 
that  allowing  programs  to  have  different  characteristics  can 
affect  processor  utilization  (which  is  fairly  obvious),  and  to 
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argas  that  processor  sharing  closely  approximates  preemptive 
priority  (which,  especially  in  view  of  the  fact  that  preemptive 
priority  is  never  defined,  is  highly  suspect  (cf.  Chapter  Five)), 
The  experimental  technique  that  Anderson  employs,  however,  shows 
great  promise. 

Anderson  performs  a  series  of  experiments  on  a  limited 
version  of  his  general  model.  He  treats  a  system  with  a  CPU, 
three  peripheral  processors,  and  three  program  classes.  He 
assumes  that  all  programs  have  the  same  mean  service  time  at  a 
particular  peripheral  processor.  Using  central  processor 
utilization  as  his  objective  function,  Anderson  attempts  to 
assess  the  impact  of  the  following  factors: 


1.  Service  disciplines: 

a.  preemptive  priority, 

b.  processor  sharing; 


2,  Peripheral  processor  contention: 

a.  no  contention  (unique  processor  selection), 

b.  equiprobable  peripheral  selection, 

c.  non-uniform  selection  functions; 


3.  I/O  boundedness: 

a.  all  programs  equivalent  and  rouahly  balanced, 

b.  one  program  CPU  bound,  two  T/0  bound, 

c.  one  I/O" bound,  one  balanced,  one  CPU  bound; 

h.  Peripheral  processor  service  rates: 

a.  all  processors  equal, 

b.  one  twice  the  speed  of  the  others, 

c.  one  half  the  speed  of  the  others. 


Only  the  processor  sharing  discipline  was  treated  with 
queueing  network  model;  a  simulator  was  used  to  obtain 
the  preemptive  priority  rule. 


the 

data  for 


Because  of  the  large  number  of  factors  (e.g.,  processors  and 
program  classes)  that  Anderson  considers,  conclusions  are 
relatively  difficult  to  draw  from  his  data.  For  the  most  part 
the  same  utilization  effects  were  felt  by  both  service 
disciplines  for  particular  variations  in  program  characteristics. 
Anderson's  approach  is  most  interesting,  and  seems  more  important 
than  the  particular  results  he  derives. 
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3.6  Omahe^'s  multiple -resource  aueues 


^ach  of  the  previous  models  has  dealt  with  the  micro¬ 
scheduling  of  systems  with  more  than  one  processor.  Wherever  a 
queue  is  allowed  to  form  it  is  for  a  single  processor,  and  the 
goal  is  to  find  a  set  of  scheduling  rules,  one  for  each 
processor,  that  maximizes  some  objective  function, 

Omahen's  model  ([Omahen  1973];  surveyed  in  [Omahen  1972], 
which  presents  the  results  without  the  mathematics)  differs  in 
two  respects.  First,  it  is  a  macro-scheduling  model:  the  two 
resource  types  are  meant  to  represent  central  processors  and 
units  of  memory.  Second,  a  program  must  queue  for  both  resources 
simultaneously. 


Dmahen  models  a  system  composed  of  two  central  processors  and 
two  units  of  memory.  There  are  two  program  classes.  Programs 
belonging  to  the  first  class  require  a  single  CPU  and  one  unit  of 
memory.  Programs  of  class  two  require  a  single  CPU  and  two  units 
of  memory.  Punning  a  class  two  program  results  in  an  idle  CPU; 
also,  running  a  class  one  program  if  the  only  other  available 
programs  belong  to  class  two  will  result  in  both  an  idle  CPU  and 
an  idle  memory  unit. 

It  is  clear  that  system  capacity  is  a  function  of  the 
scheduling  rule  in  a  multiple-resource  queue  as  defined  by 
Omahen.  This  is  not  the  case  with  queues  for  single  resources. 
One  important  question  to  ask  in  considering  scheduling  rules  for 
multiple- resource  queues  is  which  rules  are  full  capacity,  in  the 
sense  that  they  saturate  only  when  all  other  disciplines 
saturate? 

Omahen  examines  five  scheduling  rules;  f irst-come-f irst- 
served;  non-pre^mpti ve  priority  to  class  one  programs;  preemptive 
priority  to  either  class  one  or  class  two;  and  conditional 
preemptive  priority  to  class  one.  This  last  rule  favors  class 
one  if  and  only  if  there  are  at  least  two  class  one  programs 
available. 


Expected 

disciplines. 

conditional 


waiting  times  and  flow  times 
It  is  shown  that  preemptive 
priority  to  class  one  are  the 


are  derived  for  all 
class  two  priority  and 
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discipl  ines.  This  latter  scheduling  rule  hints  at  the  rather 
obvious  point  that  in  scheduling  multiple-resource  systems, 
priorities  (from  the  point  of  view  of  total  system  utilization) 
must  be  assigned  to  certain  combinations  of  programs  that 
together  possess  characteristics  that  utilize  the  entire  system 
well,  rather  than  to  individual  programs.  In  the  extension  of 
this  work  to  a  k-way  memory  partitioning  and  k  job  classes  the 
need  for  such  considerations  becomes  even  clearer,  ' 

While  this  analysis  technique  is  directed  at  higher-level 
scheduling  rules  than  will  be  considered  in  the  remainder  of  this 
thesis,  the  concept  of  queueing  simultaneously  for  more  than  one 
resource  is  an  interesting  one. 
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^ •  S Ystems  with  two  processors  an^  two  programs 

The  goal  of  this  and  the  following  chapters  is  to  provida  an 
orderly  analytic  study  of  systems  with  two  processors  (which  may 
be  thought  of  as  a  CPU  and  an  I/O  channel)  and  two  active 
programs.  No  claim  is  made  that  this  model  is  entirely  realistic 
or  that  the  results  are  immediately  applicable  to  actual  computer 
systems.  Bather,  it  is  hoped  that  a  solid  understanding  of  the 
interaction  effects  that  arise  in  this  relatively  simple  system 
will  furnish  a  basis  for  the  formal  and  intuitive  comprehension 
of  more  complex  environments. 

A:fter  a  general  discussion  of  the  model  and  solution 
techniques,  the  performance  of  various  scheduling  rules  will  be 
examined  with  an  eye  towards  finding  a  rule  that  consistently 
outperforms  others.  In  particular,  an  attempt  will  be  made  to 
evaluate  the  obvious  extensions  of  optimal  single-processor 
scheduling  rules.  Chapter  Five  considers  processor-oriented 
objective  functions;  program-oriented  objective  functions  are 
discussed  in  Chapter  Six. 

Scheduling  decisions  need  he  made  only  in  conflict 
situations.  In  a  general  system,  conflict  occurs  when  two  or 
more  programs  are  simultaneously  requesting  service  from  one 
processor.  A  scheduling  rule  is  not  specified  for  the  system  as 
a  whole;  rather,  one  is  specified  for  each  processor.  Df  course, 
the  same  rule  may  be  employed  everywhere  if  desired. 

In  this  analysis,  two  families  of  scheduling  rules  will  be 
considered.  The  first  consists  of  the  "order"  rules,  and 
includes  f ir st-come-f irst-served  (FCFS)  and  last-come-f irst- 
served  (LCFS)  .  Note  that  in  a  system  with  only  two  programs,  any 
n on-preemptive  priority  discipline  is  equivalent  to  FCFS  and  is 
therefore  a  member  of  this  family. 

The  second  family  contains  the  "preemptive  priority"  rules. 
Included  in  this  family  are  such  familiar  rules  as  full  priority 
to  one  or  the  other  class,  processor  sharing,  service  rate 
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biasing  (priority  in  inverse  proportion  to  demand),  shortest 
burst  time  first,  shortest  total  processing  time  first,  etc, 

4 . 1  The  model 

The  queueing  network  model  used  in  this  thesis  is  an 
extremely  simple  one  similar  to  several  of  those  discussed  in  the 
previous  chapter.  The  model  is  very  close  in  spirit  to 
Anderson’s  (Section  3,5). 

A  closed  system  with  two  processors  is  considered,  as  shown 
in  Figure  4.1.  Two  programs  are  always  active  in  the  system,  one 
from  each  of  two  program  classes.  Each  program  class  has  its  own 
service  time  distribution  at  each  of  the  two  processors.  The 
service  times,  which  are  exponentially  distributed,  are  meant  to 
represent  burst  times. 

Definition:  The  burst  time  of  a  program  class  at  a  processor  is 
the  time  for  which  a  program  of  that  class  will  use  the 
processor  before  completing  and  requesting  service  at  another 
processor. 

Note  that  burst  time  has  no  relation  to  either  quantum  length  (a 
parameter  defined  by  the  operating  system)  or  total  processing 
t  ime. 

Conceptually,  each  program  may  be  thought  of  as  cycling 
between  the  two  processors  a  number  of  times,  then  terminating 
and  being  immediately  replaced  by  another  program  of  the  same 
program  class.  The  phrases  ’’burst  time”  and  ’’burst  rate” 
correspond  to  the  phrases  ’’service  time”  and  ’’service  rate”  in 
the  models  discussed  in  Chapter  Three.  The  distinction  is  made 
here  in  order  to  avoid  confusion  in  future  chapters. 

Initially,  the  number  of  cycles  required  by  each  program 
class  will  be  of  no  concern;  later,  the  objective  function  will 
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be  changed  to  one  that  takes  this  factor  into  account  and  the 
model  will  be  modified  accordingly.  F.undamentally,  however,  this 
thesis  and  the  model  deal  with  micro-scheduling  and  short-term 
orocessor  allocation  rather  than  with  the  macro-scheduling  of 
multiple- resource  systems  as  considered  by  Omahen, 

Let  us  attempt  to  analyze  the  features  and  limitations  of 
this  model  in  the  context  of  those  discussed  in  the  previous 
chapter.  The  model  is  realistic  in  at  least  the  following 
respects: 

1.  More  than  one  processor  is  modelled. 

2.  There  is  a  limited  number  (namely  1)  of  each  processor  type. 

3.  Programs  are  distinguishable  from  one  another.  More 
importantly,  each  program  class  has  unique  characteristics  at 
each  processor. 

4.  The  processors  are  scheduled  independently,  and  a  wide 
variety  of  scheduling  rules  may  be  employed. 

While  models  with  more  than  one  processor  are  common  in  the 
literature,  some  of  the  other  features  listed  above  are 
frequently  omitted,  ^s  discussed  in  the  introduction  to  Chapter 
Three,  two- processor  systems  are  often  modelled  by  assuming  that 
queueing  occurs  at  only  one  processor;  the  second  processor  is 
composed  of  as  many  servers  as  there  are  programs.  Some  models 
allow  only  one  class  of  programs;  others  assume  that  all  programs 
have  the  same  service  time  on  some  processors.  Often  the  same 
(absolute-priority)  schedule  must  be  used  at  all  processors. 

Three  assumptions  of  the  model  deserve  to  be  called  into 
question : 

1.  The  infinite-source  nature  of  the  model  implies  that  when  a 
proaram  completes  it  is  immediately  replaced  by  one  of 
identical  characteristics  and  that  arrivals  and  external 
queue  lengths  may  be  neglected, 

2.  Burst  times  are  presumed  to  be  exponentially  distributed. 

3.  Preemption  overhead  is  neglected. 

The  first  criticism  may  be  sidestepped  by  claiming  that  it  is 
short-term  or  micro  scheduling  policies  that  are  being 
investigated,  and  that  program  replacement  is  therefore  not  of 
interest.  Confronting  the  issue  squarely  would  require  a  defense 
of  the  claims  that  service  time  and  burst  time  characteristics  of 
programs  may  be  accurately  estimated  and  that  certain  "types”  of 
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programs  have  similar  characteristics.  These  issues  were 
discussed  in  Section  1.3,  in  regard  to  [Lester  1973]  and  [Sherman 
1972a ]. 

The  question  of  how  burst  times  are  distributed  is  a  more 
difficult  one.  It  was  discussed  in  the  introduction  to  Chapter 
Three.  The  exponential  assumption  is  often  made  and  often 
criticized  with  respect  to  total  processing  time,  and 
considerable  effort  has  gone  into  evaluating  both  the  accuracy 
and  the  effect  of  the  assumption. 

Accepting  the  concensus  that  total  processing  times  are  not 
exponentially  distributed,  two  defenses  are  available:  claim 
that  burst  times  might  still  be  exponentially  distributed 
(Anderson's  approach);  or  point  to  Baskett's  result  that  for 
certain  restricted  types  of  systems  the  effect  of  non-exponei tial 
service  times  seems  to  be  fairly  small. 

Preemption  overhead  is  clearly  not  negligible,  and  there  is 
no  nice  way  to  account  for  it.  However,  the  analyses  that  follow 
will  show  that  the  scheduling  rules  that  call  for  a  ridiculous 
number  of  preemptions  (e.g.,  processor  sharing)  are  not  optimal 
even  when  the  cost  of  those  preemptions  is  ignored,  and  that 
while  the  performance  of  the  rules  that  use  no  preemption  (e.g., 
FCFS)  is  not  exceptionally  poor,  the  exponential  assumption  works 
in  favor  of  these  rules. 

Several  other  shortcomings  of  this  analysis  (the  fact  that 
only  two  processors  and  two  active  programs  are  treated)  are 
deficiencies  of  the  system  being  analyzed  rather  than  of  the 
model  of  that  system.  The  model  generalizes  easily  to 
accommodate  additional  program  classes  containing  one  program 
each,  and  classes  with  identical  characteristics  may  be  used  to 
model  the  concurrent  execution  of  similar  and  dissimilar 
programs.  Additional  processors  may  also  be  added  to  the  model. 
Anderson  (Section  3.5)  makes  use  of  both  of  these  extensions, 
which  result  in  complex  model  behavior  for  which  intuitive 
justification  is  difficult  to  find.  The  thrust  of  this  thesis  is 
not  to  present  a  model,  but  rather  to  gain  insight  from  the  use 
of  that  model. 


-40~ 


U  ,  2  State-transition  regrese^ntat  ions 


The  types  of  models  dealt  with  in  this  thesis  are  analyzed  by 
solving  the  steady-state  balance  equations  of  the  state- 
transition  representations  of  the  models.  The  precise  definition 
of  a  state,  and  thus  the  exact  state-transition  representation 
used,  depends  not  only  upon  the  characteristics  of  the  model 
(e.g.,  number  of  processors,  number  of  distinct  program  classes) 
but  upon  the  scheduling  disciplines  that  will  be  considered. 


Definition:  Each  program  class  has 

processor.  This  burst  rate  is 
Note  that  since  the  unit  of  rat 
for  a  program  class  may  not  be 
on  all  processors  and  dividing 
rather,  it  is  necessary  to  inve 
burst  times,  sum  these  burst  ti 
processors,  and  invert  once  aga 


the  inverse  of 
e  is  time~i,  a 
obtained  by  su 
by  the  number 
rt  all  of  the 
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Notation:  Let  u(i,j)  be  the  burst  rate  of  program  class  j  on 

processor  i.  Processors  will  be  numbered  beginning  with  zero 
and  program  classes  beginning  with  one,  and  the  term  "device” 
will  be  used  interchangeably  with  the  term  "processor".  A 
system  with  two  processors  and  two  program  classes  has  the 
following  service  rate  matrix  U: 


Class  1 
Dev.  0  1  uTT?,Tr' 
Dev.  1  1  u  ( 1 ,  1) 


Class  2 

“uiruT^r 

u  (1r2) 


Notation:  For  the  sake'^'of  economy,  a  system  with  two  processors 

and  two  program  classes  will  be  referred  to  as  a  2P22  system. 

Notation:  In  the  general  state-transition  representation  of  a 

2P2C  system,  states  will  be  designated  by  the  pair  (i,j), 
which  indicates  that  the  program  belonging  to  class  one  is  at 
processor  i  and  the  program  belonging  to  class  two  is  at 
processor  j.  If  a  scheduling  rule  of  the  order  family  is 
employed,  it  is  necessary  to  indicate  which  program  arrived 
first  when  the  programs  of  both  classes  are  at  the  same 
orocessor.  ?.r.  underscore  will  be  used  for  this  purpose;  the 
pair  (i,i)  indicates  that  both  program;^  are  at  processor 
and  that  the  class  two  program  arrived  first. 


1 
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Definition:  A  service  Erobabilit^  is  defined  for  every 

combination  of  system  state  and  program  class.  This  service 
probability  indicates  the  probability  that  the  program 
belonging  to  a  particular  class  is  receiving  service  when  the 
system  is  in  a  particular  state.  For  those  states  where  no 
conflict  exists  (i.e,,  (0,1)  and  (1,0)  in  a  2P2C  system),  all 

service  probabilities  are  unity.  In  general,  the  sum  of  the 
service  probabilities  for  a  particular  state  must  equal  the 
number  of  non-idle  processors,  and  (a  sufficient  condition) 
the  sum  of  the  service  probabilities  for  the  program  classes 
at  each  non-idle  processor  in  each  state  must  be  unity. 

Notation:  The  service  probability  is  indicated  by  q  (<st ate>, j) , 

which  denotes  the  probability  that  the  program  belonging  to 
class  j  is  receiving  service  when  the  system  is  in  the 
indicated  state.  The  identity  of  the  processor  at  which  the 
class  j  program  is  requesting  service  is  implicit  in  the 
state.  For  rotational  simplicity,  the  service  probabilities 
in  states  where  there  are  no  conflicts  (e.g.,  q((0,1),1)) 
will  be  omitted.  ?^.dd  it  ionally,  in  any  two-program  system 
g(s,2)  =  1  -  g  (s,  1)  for  s  =  (i,i).  To  reduce  the  number  of 
variables  this  form  will  be  used. 

With  this  as  background,  it  is  possible  to  construct  the 
state-transition  representation  for  a  2P2C  system  scheduled 
according  to  an  order  rule.  Each  arc  in  the  transition  diagram 
represents  the  rate  of  flow  from  one  state  to  another,  and  is 
labelled  accordingly.  Only  first-order  transitions  (those  that 
involve  the  movement  of  a  single  program)  are  considered;  higher- 
order  transitions  (e.g.,  from  state  (0,1)  to  state  (1,0))  are 
eliminated.  Strong  mathematical  justification  exists  for 
ignoring  these  higher-order  transitions  ([Conway  1967]).  Certain 
first-order  transitions  are  missing  from  the  diagram  (Figure  4.2) 
because  of  logical  restrictions;  for  example,  there  is  no 
transition  from  state  (0,1)  to  state  (0,0)  since  when  the  program 
belonging  to  claas  two  completes  at  processor  one  it.  will  not  be 
the  most  recent  arrival  at  processor  zero. 

It  is  the  service  probabilities  that  determine  the  precise 
schedule  being  evaluated.  For  FCFS,  g((0,0),1)  =  1,  q((0,0),1)  = 
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'^igure  U ,  2 


^C0>2) _ /  .  u(0,l)q((0,0),l) 
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T  snd  q((1r1.),1)  =  0.  These  four  values  imply 

respectively  that  q((C,C),2)  =  0,  g((0,0),2)  =  1,  g((I,1),2)  =  0 
and  g((1fl)r2)  =  1,  For  LCFs  scheduling,  tlie  values  are 
complemented. 

Definition:  The  ’’significant"  service  probabilities  (a  minimal 

set  from  which  all  others  may  be  computed)  will  be  called  the 

scheduling  E§ra.n>eb§rs  and  referred  to  as 

A  little  thought  will  show  that  scheduling  rules  of  the 
preemptive  priority  family  may  be  handled  as  a  snecial  case  of 
this  state  transition  representation:  by  ignoring  the  order  of 
arrival.  In  other  words,  if  g((0,0),1)  =  q((0,0),1)  and 
q((l/1)»1)  =  qf(1,I),1)  (and,  implicitly,  if  q((0,0),2)  = 

q(((5rO),2)  and  q({lr1)r2)  =  q((1rl)r2)  )  then  varying  the  two 
scheluling  parameters  over  the  range  (0,1)  will  specify  all 
scheduling  rules  of  the  second  family. 

The  state  transition  representation  can  be  simplified, 
however,  by  coalescing  states  (C,0)  and  (0,0)  into  a  single  new 
state,  (0,0)  ,  and  states  (1,1)  and  (1,1)  into  (1,1)  .  The 
resultant  representation  is  shown  in  h-igure  4.3. 

^ ^  egua t ion  solution 

It  is  necessary  to  solve  the  previous  state-transition 
representations  for  their  steady-state  probabilities  in  terms  of 
the  rates  TT  and  th^^  scheduling  parameters  0.  Assume  that  the 
system  has  reached  eauilibrium.  Then  the  rate  of  flow  into  each 
state  must  equal  the  rate  of  flow  out  of  each  state.  In  the  case 
of  preemptive  priority  scheduling  (Figure  4,3),  this  results  in 
■’^our  linear  equations  in  the  four  unknowns  (the  four  steady-state 
probabilities)  and  the  other  variables. 

As  one  example,  consider  the  balance  equation  for  state 

("r2l  : 

Pr[  (C,C)  1  (  u(0,1)q((0,0),1)  +  u(0,2)  (1-q((0,0)  ,1))  )  = 

^rr  (0,1)  1  u(1,2)  +  Prr  (1,0)  ]  u(1,1) 

"irst,  the  bad  news.  The  equations  derived  in  this  manner  are 
linearly  dependerd-,  Fext,  the  good  news.  An  additional 
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Figure  4,3 

equ=ition,  which  may  be  substituted  for  any  one  of  the  original 
four,  follows  from  the  fact  that  the  steady-state  probabilities 
must  sum  to  unity. 

The  balance  equations  may  be  solved  either  numerically,  for 
specific  cases,  or  analytically.  Both  techniques  will  be 
utilized  in  the  remainder  of  this  thesis.  Processor- orie nted 
objective  functions  may  be  calculated  directly  from  the  steady- 
s-*-ate  probabilities.  Psing  these  probabilities  in  conjunction 
with  burst  rates  and  service  probabilities  allows  the  computation 
OT^  program-oriented  objective  functions. 
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^  ob  jective  function 

h  number  of  previous  analyses  have  considered  central 
processor  utilization  as  the  objective  function.  This  is  a  f 
from  meaningless  measure,  but  it  seems  reasonable  to  consider 
processors  in  the  system  to  be  of  at  least  some  value.  In  th 
chapter  each  of  the  two  processors  will  be  presumed  to  be  egu 
valuable  (or  equally  scarce)  and  total  utilization,  the 
unweighted  average  of  the  utilizations  of  the  two  processors, 
will  be  used  as  the  objective  function.  An  argument  could  al 
be  made  for  considering  weighted  utilizations. 

T^efinition:  Total  222222§2£  Jibilization  in  a  2P2C  system  is 

unweighted  average  of  the  steady-state  utilizations  of  th 
two  processors. 

The  total  utilization  measure  has  certain  characteristics 
regardless  of  the  scheduling  rules  employed.  For  example,  if 
both  program  classes  have  identical  burst  time  characteristic 
then  the  scheduling  rules  can  have  no  effect  on  it.  This  is 
to  the  memoryless  property  of  the  exponential  distribution,  w 
implies  that  a  program  that  is  preempted  will,  when  restarted 
have  expected  characteristics  no  different  than  when  it  first 
arrived  at  the  processor.  Obviously  this  is  a  shortcoming  of 
exponential  assumption. 

5 . 1  f CFS  versus  the  grio rity  disciplines 

First-come-f irst-served  is  a  scheduling  rule  that  require 
information  concerning  burst  time  distributions  and  does  not 
use  of  preemption.  Because  it  is  both  simple  and  efficient  ( 
an  overhead  sense)  it  is  frequently  used  in  actual  computer 
sy ste  ms. 

Definition:  The  first -come- first-served  (FCFS)  scheduling  ru 

assigns  the  processor  to  the  first  program  that  requests 
and  allows  that  program  to  complete  its  service  without 
interruption. 
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As  an  unintelligent  rule,  FCFS  should  perform  poorly  with 
respect  to  total  processor  utilization.  In  this  section  it  will 
be  compared  numerically  to  processor  sharing,  a  ’’typical”  rule  of 
the  priority  family. 

Processor  sharing  is  a  theoretically  interesting  but 
practically  useless  scheduling  rule. 

Definition:  The  Rrocessor  sharing  (PS)  discipline  assigns  egual 

shares  of  a  processor  to  each  program  requesting  service  at 
that  processor. 

PS  is  the  limiting  case  of  round-robin  scheduling  as  the  quantum 
length  is  reduced  to  zero,  and  an  implementation  that  clasely 
approximates  the  theoretical  rule  would  require  an  unreasonable 
number  of  preemptions.  When  preemption  costs  are  low  enough, 
though,  it  is  sometimes  possible  to  approximate  PS  sufficiently 
closely  with  reasonable  efficiency. 

While  PS  does  not  make  explicit  use  of  burst  time 
information,  the  rapid  switching  of  the  processor  from,  program  to 
program  prevents  a  program  with  a  short  service  requirement  from 
’’getting  stuck”  behind  a  longer  program,  as  happens  under  FCFS. 
This  feature  is  advantageous  when  the  variance  in  burst  times  is 
high.  The  performance  of  PS  has  been  shown  to  be  ’’good”  in  many 
situations  (always  assuming  zero  preemption  cost)  ,  a  judgement 
that  will  be  confirmed  by  the  results  in  this  chapter.  In  the 
model,  PS  is  represented  by  setting  each  service  probability  to 
0.5  in  the  two  conflict  situations. 

The  numerical  comparison  approach  will  be  used  frequently  in 
the  remainder  of  this  thesis.  While  it  lacks  formality,  it  often 
gives  a  much  better  intuitive  grasp  of  the  relationships  between 
various  rules  than  do  sets  of  inequalities  that  have  been 
obtained  analytically.  A  ’’complete”  set  of  test  data  is  used, 
a rri ved  at  in  the  following  manner: 

First,  it  is  possible  to  take  advantage  of  the  processor 
symmetry  inherent  in  the  model.  This  means  that  a  trial  using 
the  rate  set: 
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Class_1  Class  2 
Pgv.  Oja  n 

Dev,  1  I  c  a 


is  equivalent  to  one  using  the  rate  set: 


Class_1 _ Class  2 

Dev.  0  I  c  "S. 

Dev.  1  I  a  b 


Second,  an  identical  symmetry  exists  with  respect  to  program 
classes;  the  following  rate  set: 


Class_2 _ Class  2 

Dev.  Ola  b 

Dev.  1  1  c  d 


is  equivalent  to: 


Class__1 _ i!lass_2 

Dev.  0  1  *■  b  ”■  a 

Dev,  1  I  d  c 


Taking  advantage  of  these  symmetries,  a  "factorial” 
experiment  of  twenty-one  trials  was  constructed,  using  from  one 
to  four  different  rates.  One  trial  is  devoted  to  the  case  of 
identical  rates;  five  to  the  tvro-rate  case;  nine  to  the  three- 
rate  case;  and  six  to  the  situation  in  which  there  are  four 
different  rates.  The  particular  choice  of  rate  values  (factors 
of  100)  was  found  to  clearly  reflect  phenomena  that  are  also 
present  with  other  values. 
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BLOWUPS  are  a  program-oriented  objective  function.  The 
blowup  factor  of  a  program  class  indicates  the  degree  to  which 
the  cycle  time  of  a  program  belonging  to  that  class  is  expanded, 
by  a  particular  combination  of  scheduling  rule  and  competition 
from  other  programs,  over  its  cycle  time  if  it  were  the  only 
program  in  the  system.  A  blowup  factor  of  one  indicates  no 
expansion;  a  blowup  factor  of  four  means  that  the  cycle  time  is 
quadrupled;  values  in  excess  of  1000  are  not  displayed.  Blowup 
factors  will  be  considered  in  the  chapter  that  follows. 
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TRIAL 

NO.  1 

RULE  1 

utilizations  -  % 

1  BLOWUPS 

RATE 

SET ; 

Oil 

JIEV^O  QEV.l  TOTAL 

1  CL.  1  CL.  2 

F  F 

66.66  66.66  66.66 

1.5  1.5 

DEV.O 

1  1 

1 

F  P 

P  F 

66.66  66.66  66.66 

66.66  66.66  66.66 

1.5  1.5 

1.5  1.5 

DEV.  1 

1  1 

1 

P  P 

66.66  66.66  66.66 

1.5  1.5 

TRIAL  NO.  2 

RULE  I 

1  UTILIZATIONS  -  %  1 

BLOWUPS 

rate  SET: 

F  F  40.39  79.99  60.19 

_ CLx_2 

a. 5  1.5 

CLx_2 

F  P 

33.99  83.24  58,62 

2.0  1.5 

DEV. 01  100 

1 

P  F 

33.99  99.33  66.66 

1.5  1.5 

DEV. 11  1 

1 

P  P 

33.99  99.33  66.66 

1.5  1.5 

TRIAL 

NO.  3 

RULE  1 

utilizations  -  %  1 

BLOWUPS 

RATE 

SET: 

_0-I-  J-UE  Li.i,Q£^JL.IOIAL,m 
F  F  0.99  99,99  50.49 

_ 

2.0  2.0 

— _ 

F  P 

0,99  99.99  50,49 

2,0  2.0 

DEV.O 

1  100  100 

P  F 

0.99  99,99  50.49 

2.0  2,0 

DEV.l 

1  1  1 

P  P 

0,99  99.99  50.49 

2.0  2.0 

trial 

NO,  4 

RULE  1 

utilizations  -  %  1 

BLOWUPS 

RATE 

SET: 

-.0_l_iJ2Fi^0. 
F  F  50.49 

.ll£ixi_iaiAL_J_ax_l _ £Lx_£ 

50.49  50.49  100.0  1.0 

F  P 

60.23 

60.23  60,23 

2.5  1.5 

DEV.O 

1  100 

1 

P  F 

60,23 

60.23  60.23 

2.5  1.5 

DEV.l 

1  100 

1 

P  P 

66 . 66 

66,66  66.66 

1.5  1.5 

TRIAL  NO.  5 
RATE  SET: 

— ax_i_. 

^QLjl-Z 

RULE 

F  F 

F  P 

1  utilizations  -  %  1 

66.66  66.66  66.66 

98,53  50,73  74,63 

BLOWUPS 

_ 

1.5  1.5 
2.0  1.0 

DEV.O 1 

100 

1 

P  F 

50.73  98,53  74.63 

1.0 

2.0 

DEV.l 1 

1 

100 

P  P 

98,07  98.07  98,07 

1.0 

1.0 

trial 

NO,  6 

RULE  1 

1  UTILIZATIONS  -  %  1 

BLOWUPS 

RATE 

SET: 

0  1  1 

1  DEV.O 

DEV.l 

TOTAL  1 

CL.  1 

CL.2 

F  F 

1.97 

99,49 

50,73 

51.0 

1.0 

— _ 

F  P 

33.99 

99.33 

66,66 

1.5 

1,5 

DEV.O 

1  100 

100 

P  F 

2,28 

99.33 

50.81 

38,5 

1.0 

DEV.l 

1  100 

1 

P  P 

33.99 

99.33 

66 . 66 

1.5 

1.5 

Table  5 .  1 
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TRIAL  NO. 


RULE 


UTILIZATIONS  -  %  I 


BLOWUPS 


nH  1  c-  3C.  »  • 

F 

.J<_ 

F 

-i- 

u&y  ■  V. 

66.88 

66,23 

66.55 

ViLt 

1.5 

-vL.a.  .fa 

1.5 

F 

P 

99.98 

49.76 

74.87 

1.0 

2.0 

DEV. 01  1 

100 

P 

F 

50.97 

98.04 

74.51 

2.0 

1.0 

DEV. 11  10000 

1 

P 

P 

99.98 

98.03 

99.00 

1.0 

1.0 

TRIAL  NO.  8 

RULE 

1 

UTILIZATIONS  -  % 

1 

BLOWUPS 

RATE  SET: 

P 

\ 

1 

aEV..O 

QEV.l 

TOTAL 

1 

CL.  1 

CL.2 

F 

F 

50.49 

50.00 

50,25 

1.0 

198.0 

F 

P 

99.01 

50,00 

74.51 

1.0 

2.0 

DEV. 01  1 

100 

P 

F 

66.88 

33.77 

50.33 

1.5 

3.0 

DEV. 11  1 

10000 

P 

P 

99.33 

33,99 

66 . 66 

1.5 

1.5 

TRIAL  NO.  9 

PULE 

1 

UTILIZATIONS  -  % 

1 

BLOWUPS 

RATE  SET: 

0 

1 

1 

D.EV^a, 

DEV..1. 

TOT^L 

! 

CL.  1 

CL.  2 

F 

F 

0.50 

99.99 

50.25 

2.0 

2.0 

F 

P 

0.50 

99.99 

50.24 

2.0 

2.0 

DEV. 01  10000 

100 

P 

F 

0.50 

99.99 

50.25 

2.0 

2.0 

DEV. 11  1 

1 

P 

P 

0,50 

99.99 

50.25 

2.0 

2.0 

trial  NO.  10 

RULE 

1 

utilizations  -  * 

1 

BLOWUPS 

RATE  SET: 

Q 

1 

1 

QEV.0 

QEV...L 

IQIAL 

1 

CL.  1 

£Lx-2 

F 

F 

99.50 

0.99 

50.25 

1.0 

100.5 

-jLLx-a 

F 

P 

99.98 

0.99 

50,49 

1.0 

51.0 

DEV. 01  1 

100 

P 

F 

99.74 

0.99 

50.37 

2.0 

2.0 

DEV. 11  100 

10000 

P 

P 

99.99 

0.99 

50.49 

2.0 

2.0 

TRIAL  NO.  11 

RULE 

1 

utilizations  -  % 

1 

BLOWUPS 

RATE  SET: 

0 

.L 

JQiL)U!L 

J:oIAL_I^^JI _ CLx«2 

F 

F 

1  .98 

99.00 

50,49 

1,0 

100.0 

_-CLLx-2 

F 

P 

98,07 

98.07 

98,07 

1.0 

1.0 

DEV. 01  100 

100 

P 

F 

2.91 

98.07 

50.49 

1.0 

51.0 

DEV. 11  1 

10000 

P 

P 

98.07 

98.07 

98,07 

1.0 

1.0 

TRIAL  NO.  12 

RULE 

1 

utilizations  -  % 

1 

BLOWUPS 

RATE  SET: 

9 

1 

aEv^.a 

DEV.l 

.TOTAL 

1 

F 

F 

0.99 

99,99 

50.49 

50.5 

1.0 

— a^i_. 

-CLLx-2 

F 

P 

33.34 

99,66 

66.50 

1.5 

1.5 

DEV. 01  100 

10000 

P 

F 

0.99 

99.99 

50,49 

51,0 

1.0 

DEV. 11  100 

1 

P 

P 

33,33 

99,99 

66 . 66 

1.5 

1.5 

Table  5 , 1 


2 


-50- 


TRIAL  NO.  13 

RULE  1 

1  utilizations  -  %  1 

BLOWUPS 

rate  SET: 

0 

^1 _ CL.._2 

F  F 

99.01  0.99  50,00 

1.0 

F  P 

99,01  1.48  50,25 

1.0 

100.0 

DEV.OI  1 

10000 

P  F 

99.34  33.77  66.56 

1.5 

1.5 

DEV. 11  100 

10000 

P  P 

99,33  33.99  66.66 

1.5 

1.5 

TRIAL  NO,  14 


RULE  I  utilizations  -  *  I 


BLOWUPS 


RATE 

SET: 

0  1  1  DFV.O  DFV.l  TOTAL  1  CL.  1  CL.  2 

F  F 

99.98 

1.00 

50,49  1,0 

99.0 

F  P 

99.98 

1.00 

50.49  1.0 

99.5 

DEV. 

0 1  1 

1  0000 

P  F 

99.02 

98.04 

98.53  1.0 

1.0 

DEV. 

11  10000 

100 

P  P 

99,98 

98,03 

99.00  1,0 

1.0 

TRIAL  NO,  15 

RULE  1 

UTILIZATIONS  -  %  1 

BLOWUPS 

RATE  SET: 

0  1  1  DEV.O  DEV.l  TOTAL  1  CL 

— I, .  CL  >  i2 

F  F 

0,01  99.99  50.00 

1.0 

100.0 

F  P 

0.50  99.99  50,25 

2.0 

2.0 

DEV.OI  10000 

10000 

P  F 

0,01  99.99  50.00 

1.0 

99.5 

DEV. 11  1 

100 

P  P 

0.50  99.99  50.25 

2.0 

2.0 

TRIAL  NO.  16  RULE  I 

RATE  SET: 

F  F 

— _ F  P 

DEV.OI  1  100  P  F 

DEV.l I  10000  1000000  P  P 


utilizations  -  %  I  BLOWUPS 


QEV..0 

DEV.l 

TOTAL  1 

cl:  1 

CL.2 

99.99 

0,00 

50,00 

1.0 

101,0 

99.99 

0.00 

50.00 

1,0 

100.0 

99,99 

0.00 

50,00 

2.0 

2.0 

99,99 

0,00 

50.00 

2.0 

2.0 

TRIAL  NO,  17 

RULE  1 

1  utilizations  -  %  1 

BLOWUPS 

RATE  SET: 

0 1  1 

l QEV^0 

CIEV..1  . 

.TOTAL.  J. 

CL.  1 

CL._2 

F  F 

1,00 

99.98 

50,49 

1.0 

100,0 

— CLx_i_ 

F  P 

99,98 

99,00 

99,49 

1.0 

1.0 

DEV.OI  10000 

100 

P  F 

1.00 

99,98 

50,49 

1.0 

100.0 

DEV. 11  1 

1000000 

P  P 

99,98 

99,98 

99.98 

1.0 

1.0 

TRIAL  NO.  18 

RULE  1 

UTILIZATIONS  -  % 

1  BLOWUPS 

RATE  SET: 

0  I  I 

aEv^o 

DEV.l 

TOIAL. 

1  CL.  1 

CL.2 

F  F 

0.99 

99.00 

50,00 

1.0 

_ CLx-2 

F  P  • 

98.05 

99.00 

98.53 

1,0 

1.0 

DEV.OI  100  10000 

P  F 

1  ,96 

98.05 

50,00 

1.0 

101.0 

DEV, 11  1  1000000 

P  P 

98.07 

98,07 

98.07 

1.0 

1.0 

Table  5 . 1 
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TRIAL  NO,  19 

RULE  1 

1  utilizations  -  %  1 

BLOWUPS 

rate  SET: 

0  1  J 

1  DEV.O 

DEV.l 

TOTAL  1 

CL.  1 

CL.  2 

F  F 

99.99 

0.00 

50.00 

100.0 

1.0 

— _ 

F  P 

99.99 

0.00 

50.00 

100.0 

1.0 

DEV. 01  100  1 

P  F 

99.99 

0.49 

50.24 

2.0 

2.0 

DEV. 11  10000  1000000 

P  P 

99.99 

0.49 

50.24 

2.0 

2.0 

TRIAL  NO 
RATE  SET 

.  20 

• 

• 

RULE 
£)  L. 

1  UTILIZATIONS  -  %  1 
l DFV.0  DEV.l  TOTAL  1 

k- 

BLOWUPS 
CL^1 CL^2 

F 

F 

99.99 

0.99 

50.49 

99.0 

1.0 

£Lx-J _ CLx-2 

F 

P 

99.99 

0.99 

50.49 

100.0 

1.0 

DEV.O 1 

10000  1 

P 

F 

99.02 

98.03 

98.53 

1.0 

1.0 

DEV.l 1 

100  1000000 

P 

P 

99.99 

98.03 

99,01 

1.0 

1.0 

TRIAL  NO.  21 

RULE 

1  UTILIZATIONS  -  %  1 

BLOWUPS 

RATE  SET: 

.  i-JlE  L-TQTAL-1^ 

F  F 

0.99  99.01  50,00 

1.0 

_ 

F  P 

0.99  99,50  50,25 

2,0 

2.0 

DEV.OI  100  1000000 

P  F 

0.99  99.98  50.49 

1.0 

100.0 

DEV, 11  1  10000 

P  P 

0.99  99.99  50,49 

2,0 

2.0 

Table 

5.1-4 

Table  5. 1  contains  much  more  information  than  will  ba 
discussed  immediately.  A  few  general  comments  are  in  order, 
however. 

FCFS  practiced  at  both  processors  is  consistently  inferior  to 
PS  at  both  processors  (except,  of  course,  when  the  classes  have 
identical  burst  rate  characteristics  (trials  1  and  3))  ,  and  is 
generally  inferior  to  the  other  two  discipline  combinations  as 
well. 

The  degree  of  improvement  that  PS  af  doth  processors  provides 
over  the  other  combinations  is  greatest  when  the  program  classes 
are  of  a  "complementary”  nature  (i.e.,  the  programs  of  each  class 
spend  the  majority  of  their  time  at  opposite  processors,  and  the 
total  cycle  times  for  the  two  classes  are  approximately  the  same; 
trials  5  and.  14).  When  both  program  classes  have  the  same  rate 
at  a  processor,  then  either  FCFS  or  PS  may  be  used  at  that 
processor  without  affecting  total  utilization  when  PS  is  employed 
at  the  unegual-rate  processor  (trials  2,  6,  9,  11  and  15). 
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There  are  two  cases  (trials  8  and  18)  in  which  practicing 
FCFS  at  one  processor  and  PS  at  the  other  yields  higher  total 
utilization  than  does  PS  at  both  processors.  No  intuitive 
explanation  comes  to  mind.  In  both  cases  one  program  class  has  a 
higher  rate  at  each  processor  than  does  the  other  class,  but  this 
is  also  true  in  trials  U,  10,  13,  16  and  21.  In  neither  case  do 
the  classes  have  "similar"  characteristics  in  that  they  have 
their  higher  rates  on  the  same  proceSs-or,  but  trial  1-3  is 
difficult  to  distinguish  from  trial  8  in  this  respect.  The  next 
section  will  show  trials  8  and  18  to  be  pathological  cases. 

The  general  conclusion  to  be  drawn  from  this  experiment  is 
that  while  FCFS  is  substantially  inferior  to  PS  when  the  program 
classes  are  complementary,  there  are  circumstances  in  which  it  is 
either  nearly  equivalent  or  actually  superior.  FCFS  can  not,  on 
the  basis  of  this  data,  be  ruled  out  as  a  "reasonable"  scheduling 
rule,  particularly  since  preemption  costs  have  been  ignored  in 
this  analysis.  P.s  previously  mentioned,  however,  the  exponential 
assumption,  with  its  relatively  low  variance,  works  in  favor  of 
FCvs.  A.lso,  FCFS  should  perform  less  well  as  more  programs  are 
added  to  the  system.  Later  sections  of  this  chapter  will  show 
that  certain  rules  of  the  preemptive  priority  family  are 
consistently  superior  to  both  PS  and  FCFS. 


5 . 2  Service  rate  bia sing  versus  processor  sharing 


Varying  the  two  scheduling  parameters  in  the  model  of 
preemptive  priority  family  rules  specifies  an  infinite  number  of 
different  schedulers.  Pecall  that  each  of  these  parameters, 
l((0,0),1)  and  g((1r1),1)  in  Figure  4.3,  may  take  on  values 
anywhere  in  the  closed  interval  [0,1].  The  former  parameter  may 
be  thought  of  as  the  priority  of  class  one  programs  at  processor 
zero;  the  latter  as  the  priority  of  class  one  programs  at 
processor  one. 


While  it  is  not  totally  justified  to  restrict  the  search  for 
"good"  schedules  to  only  the  parameter  values  of  disciplines  that 
have  some  intuitive  meaning,  only  a  few  of  the  possible  values 
for  these  parameters  seem  at  all  reasonable.  Aside  from  the  four 
cases  where  full  preemptive  priority  at  each  processor  is  given 
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to  some  program  class,  only  processor  sharing  and  service  rate 
biasing  fall  into  this  category.  In  Section  5.3  and  Appendix  A, 
justification  for  the  restricted  search  will  be  presented. 


Service  rate  biasing  is  a  nod  in  the  directioh  of  shortest 
processing  time  first  scheduling  and  is  the  first  rule  considered 
that  uses  burst  time  information. 


Definition: 
program 
rate  on 
use  the 
of  the  p 


The  service  rate  biasing  (SR3)  disci 
class  priority  at  a  processor  in  prop 
that  processor  relative  to  the  other 
processor,  or  in  inverse  proportion  t 
rocessor’s  power  that  it  consumes. 


pline  g 
ortion 
program 
o  the  p 


i  ves  a 
to  its 
s  that 
ercen tage 


For  example,  if  the  two  program  classes  of  the  model  had 
rates  of  5  and  10  at  a  processor,  they  would  be  assigned 
probabilities  of  .33  and  ..66  respectively. 

In  practical  terms,  it  is  difficult  to-  describe  what 
implementation  would  entail.  Bound-robin  processor  sched 
with  a  quantum  length  proportional  to  burst  rate  seems  li 
most  reasonable  way  to  view  it.  As  is  the  case  with  proc 
sharing,  an  implementation  that  closely  approximated  true 
rat?  biasing  would  require  an  unreasonable  number  of  pres 

As  discussed  in  the  introductory  chapter,  scheduling 
that  make  use  of  burst  time  information  become  increasing 
valuable  as  burst  time  variance  increases.  A  fundamental 
question  behind  all  such  disciplines  concerns  their  robus 
the  degree  to  which  burst  time  estimates  can  be  in  error 
performance  degrades  to  a  level  below  that  of  less  clevsr 


burs 

t 
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an  S 
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Is  service  rate  biasing  ever  superior  to  processor  sharing? 
If  so,  is  it  consistently  superior?  The  results  in  Table  5.2 
indicate  that  SBP  practiced  at  both  processors  gives  generally 
better  performance  with  respect  to  the  total  processor 
utilization  measure,  and  in  Section  5.3  it  will  be  seen  that 
except  for  three  cases,  SPB  does  very  nearly  optimally  well 
according  to  this  objective  function. 


There 
again)  in 


are,  however,  a  few  instances  (trials  8  and  18,  once 
which  PS  is  preferable  to  SRB.  In  these  two  trials 
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thial  no.  1 

RATE  SET!  ^QLj^} _ 

DEV,  01  1  1 

DEV,  II  1  1 


I  utilizations  -  %  I  BLOWUPS 

_ SCUEDULIiifi-fiyLE _ I  DEV,.0  DEV.l  TOTAI  I  CL.  1  CL.  2 

PURE  PROCESSOR  SHARING  66,66  66.66  66,66  1.5  1.5 

SERVICE  RATE  BIASING  66.66  66,66  66.66  1.5  1,5 


trial  no.  2 
rate  set: 

DEV,  01  100  1 

DEV.  II  1  1 

I  utilizations  -  %  I  BLOWUPS 

_ S£JjEDLiLIIi^RyL£ _ L^E^Q-Q£yj.i^QTAL-J.-XLiU_£Lx-£ 

PURE  PROCESSOR  SHARING  33.99  99,33  66.66  1.5  1.5 

SERVICE  RATE  BIASING  33.99  99.66  66.82  1.5  1.5 


TRIAL  NO.  3 

RATE  set:  _QLx_L_£Lt_2 

DEV.  01  100  100 

DEV,  11  1  1 

I  utilizations  -  %  I  BLOWUPS 
- S£aEI}!iLIfi£-RilL£ _ IJlU^JlLljuULQlAL^^L  ■ 

PURE  PROCESSOR  SHARING  0,99  99.99  50.49  2.0  2.0 

SERVICE  RATE  BIASING  0,99  99,99  50,49  2.0  2.0 


trial  no.  4 

RATE  set:  _CLx-.I_.ax_a 

DEV.  01  100  1 

DEV,  II  100  1 

I  utilizations  -  %  I  BLOWUPS 

- SCHEOaiEiaRyLL _ L_DElx£J2£:yxi_IOlAL_l_£Lx-i__ax_2 

PURE  PROCESSOR  SHARING  66,66  66.66  66.66  1.5  1.5 

SERVICE  RATE  BIASING  74.75  74,75  74.75  1.0  2,0 


trial  no.  5 

set:  __ax_j _ ax_a 

DEV.  01  100  1 

DEV,  II  1  100 

I  utilizations  -  %  I  BLOWUPS 

- SCtjEDaiNatiyLE _ L_DCiLd2_U£^xi_IClAL_i_CLx-i_-ax_2 

PURE  PROCESSOR  SHARING  98.07  98.07  98.07  1.0  1.0 

SERVICE  RATE  BIASING  99,00  99,00  99.00  1.0  1.0 


Table  5.2 


1 


TRIAL  NO.  6 

RATE  set:  _ CLx-2 

DEV,  01  100  100 

DEV.  II  100  1 

I  utilizations  -  %  I  BLOWUPS 

_ SCtlLDLiLlIii^RLiLt _ j:)£Vj.L,IQlAL-l-XLx-I^aa^ 

PURE  PROCESSOR  SHARING  33. 9Q  99,33  66.66  1.5  1,5 

SERVICE  RATE  BIASING  50.08  99.33  74,71  1.0  2.0 


trial  NO. 
RATE  set: 

7 

DEV.  Ol’ 

1 

100 

• 

DEV,  11 

10000 

1 

1  utilizations  -  %  1 

BLOWUPS 

_ SLtlEDLa.INS-EyL£ _ J2tyU-J[aiAL-l-£L.-JL--CLiu-2 

PURE  PROCESSOR  SHARING  99.98  98,03  99,00  1.0  1.0 

SERVICE  RATE  BIASING  99.99  99.00  99.49  1.0  1.0 


TRIAL  NO.  8 

RATE  SET:  CLjl_] _ 

DEV,  01  1  100 

DEV.  II  1  10000 

I  UTILIZATIONS  -  %  I  BLOWUPS 
_ §£ljEQLJLlNG^EL!L£_ _ LJ^ELi.5 

PURE  PROCESSOR  SHARING  99.33  33.99  66.66  1.5  1.5 

SERVICE  rate  BIASING  99.98  2.90  51,44  26.0  1.0 


trial  no.  9 

rate  set:  _ QLx.a 

DEV,  01  10000  100 

DEV.  II  1  1 

I  utilizations  -  %  I  BLOWUPS 
_ SOdEDULLIilG-EiiLE _ L_aE 

PURE  PROCESSOR  SHARING  0,50  99.99  50,25  2.0  2.0 

SERVICE  RATE  BIASING  0.50  99.99  50.25  2.0  2.0 


QLx-a 

100 
10000 

I  utilizations  -  %  I  BLOWUPS 

_ SttJEDULIiili^RyLE _ 

PURE  PROCESSOR  SHARING  99.99  0,99  50.49  2.0  2.0 

SERVICE  RATE  BIASING  99.99  0.99  50.49  50,5  1,0 


trial  NO,  10 

RATE  set:  _CLj_L 

DEV.  01  1 

DEV,  II  100 


Table  5.2  -  2 
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TRIAL  NO,  U 
RATE  set: 

DEV,  01  100  100 

DEV,  II  1  10000 

I  utilizations  -  %  I  BLOWUPS 

_ S£H£DULItl(2-£yLi; _ IJ^DUQ  J2£y  xl^fiLlAL  JL-CL  ■ 

PURE  PROCESSOR  SHARING  98.07  98.07  98,07  1.0  1,0 

SERVICE  RATE  BIASING  99.02  98.07  98. SS  1.0  1.0 


trial  no.  12 
RATE  set: 

DEV.  01  100  10000 

DEV.  II  100  1 

I  utilizations  -  %  I  BLOWUPS 

_ S£ljEDULI±i£-BiiL£ _ LJ2DLti>  J2£yU^aiAL_l_£L.._l«_£L^ 

PURE  PROCESSOR  SHARING  33.33  99,99  66.66  1.5  1.5 

SERVICE  RATE  BIASING  49.75  99,99  74,87  1,0  2.0 


TRIAL  NO.  13 

rate  set:  _£LiL_J _ QL^Z 

OEV.  01  1  10000 

DEV,  II  100  10000 

I  utilizations  -  %  I  BLOWUPS 

_ SQH£DLlLIN^RllL£__l_0£:^J^£y4.i_I0lAL_l_£:Lt^— 

PURE  PROCESSOR  SHAPING  99.33  33.99  66.66  1.5  1.5 

SERVICE  RATE  BIASING  99,50  50,49  74.99  2.0  1,0 


trial  no.  14 

rate  set:  _ .QLju^Z 

DEV,  01  1  10000 

DEV,  II  10000  100 

I  utilizations  -  %  I  BLOWUPS 
_ ^Q^L£tULlU(L.BllLL _ L_QEXtja 

PURE  PROCESSOR  SHARING  99.98  98.03  99.00  1.0  1.0 

SERVICE  RATE  BIASING  99.99  99.00  99,49  1.0  1,0 


trial  no.  15 

RATE  SET:  _£Lx_J _ CLx_2 

DEV.  01  10000  10000 

DEV.  II  1  100 

I  utilizations  -  %  I  BLOWUPS 

- SQJEQULim.RyL£ _ L_aDM)  J2£y^_r(2lAL_l_£LxU_-.CLx_2 

PURE  PROCESSOR  SHARING  0,50  99.99  50.25  2.0  2.0 

SERVICE  RATE  BIASING  0,98  99.99  50,49  51.0  1.0 


Table  5.2 
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trial  NO.  16 
RATE  set: 

OEV.  01 
DEV,  II 

I  utilizations  -  %  I  BLOWUPS 

_ SCL!E:QULItl£-.RUL£; _ 

PURE  PROCESSOR  SHARING  99.99  0,00  50,00  2.0  2,0 

SERVICE  RATE  BIASING  99.99  0.00  50.00  100.0  1.0 


1  100 
10000  1000000 


TRIAL  NO.  17 

rate  set: 

DEV,  01  10000  100 

OEV.  II  1  1000000 

I  utilizations  -  %  I  BLOWUPS 
_ S£HE.DLLLINi2-£yL£ _ 

PURE  PROCESSOR  SHARING  99.98  99,98  99,98  1.0  1.0 

SERVICE  RATE  BIASING  99.99  99.98  99,98  1.0  1,0 


TRIAL  NO.  18 

RATE  set:  _ QL^Z 

DEV.  01  100  10000 

DEV.  II  1  1000000 

I  UTILIZATIONS  -  %  I  BLOWUPS 

_ _ L^EX«^-££y^-j:aXAL-l^L  t.-I^£Lx-Z 

PURE  PROCESSOR  SHARING  98.07  98,07  98.07  1.0  1.0 

SERVICE  rate  BIASING  99.34  66.77  83.05  1.5  1.0 


TRIAL  NO.  19 

RATE  set:  _ QLjl^Z 

DEV,  01  100  1 

DEV.  II  10000  1000000 

I  utilizations  -  %  I  BLOWUPS 

_ S£h!EQiiLIil£_RyL£ _ 

PURE  PROCESSOR  SHAPING  99.99  0,49  50.24  2.0  2.0 

SERVICE  RATE  BIASING  99.99  0,98  50.49  1,0  51.0 


trial  NO.  20 

RATE  SET:  £LLx_1~CLju_2 

OEV.  01  10000  1 

DEV.  II  100  1000000 

I  utilizations  -  %  I  BLOWUPS 

_ S£JHEnULL!i^RyL£ _ J^iLls^-LQlAL^^L  t 

PURE  PROCESSOR  SHARING  99.99  98.03  99.01  1,0  1.0 

SERVICE  RATE  BIASING  99.99  99,00  99.50  1,0  1,0 


Table  5.2 
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TRIAL  no.  21 
RATE  set: 


_ CLju-a 

100  1000000 
1  10000 


OEV.  01 
DEV.  II 


I  utilizations  -  %  I  BLOWUPS 

_ §£a£DULl!iiS-.RyL£ _ L_DE^^Q  D_EV^1  TOTAL  I  Cl  .  1  CLi  2 

PURE  PROCESSOR  SHARING  0.99  99.99  50.49  2.0  2.0 

SERVICE  RATE  BIASING  0.99  99.99  50.49  99.0  1.0 

Table  5.2-5 

performance  of  scheduling  rules  declines  as  their  use  of  burst 
time  inf orm.a tion  increases.  Tn  other  words,  FCPS  is  superior  to 
PS,  which  in  turn  is  superior  to  SEP.  As  mentioned  in  the 
previous  section  there  is  no  obvious  explanation  for  this 
anomaly,  which  will  appear  once  again  in  Section  5.3. 

Perhaps  the  most  striking  thing  about  the  data  in  Table  5.2 
is  that  in  a  number  of  instances  where  SEE  yields  at  least  as 
high  a  total  utilisation  figure  as  PS,  the  "average-'  blowup 
factor  is  significantly  higher  under  SEE.  In  other  words,  one 
program  class  is  being  penalised  substantially  in  order  to  obtain 
only  slightly  higher  utilization.  This  subject  will  be  discussed 
in  Chapter  Six. 

lull 

Service  rate  biasing  appears  to  be  generally  superior  to 
processor  sharing  with  respect  to  total  processor  utilization. 
A.ddit  iona lly ,  it  might  be  implemented  with  a  lower  preemption 
frequency.  Por  both  rules,  however,  the  preemption  rate  may  be 
prohibitively  high,  and  it  rises  as  the  discrete  approximation  to 
the  rule  becomes  closer. 

Scheduling  rules  that  reauire  preemption  no  more  frequently 
than  upon  program  arrivals  fall  within  the  "reasonable"  realm 
from  the  point  of  view  of  preemption  cost.  FCES  and  the  non- 
preemptive  priority  disciplines  obviously  fit  into  this  category. 
A.mong  the  rules  in  the  oreeraptive  priority  family,  the  full 
priority  disciplines  also  satisfy  the  criterion. 
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Definition:  The  full  Rriorit^  subfamily  of  rules  allocates  the 

processor,  in  every  specific  conflict  situation,  to  one  and 
only  one  program  class. 

Schelules  and  scheduling  rules  of  the  preemptive  priority  family 
that  are  not  full  priority  will  be  referred  to  as  partial 
priority. 

In  the  notation  of  Figure  4.3,  the  full  priority  schedules  at 
processor  zero  and  processor  one  correspond  respectively  to 
q((0,0),1)  and  g((1,1),1)  independently  taking  on  the  values  zero 
or  one.  Considering  both  processors  at  once,  there  are  obviously 
four  distinct  pairs  of  full  priority  schedules  for  this  system. 

Notation:  '^’or  the  system  with  two  processors  and  two  program 

classes,  the  pairs  of  full  priority  schedules  will  be  denoted 
by  the  pair  indicating  that  program  class  i  has  full 

preemptive  priority  at  processor  zero  and  program  class  j  has 
full  preemptive  priority  at  processor  one. 

Recall  from  Chapter  One  the  following  pair  of  definitions: 

Definition:  A  schedule  indicates  which  member  of  a  queue  should 

be  receiving  service  at  a  particular  instant  in  time. 

Definition:  P.  scheduling  rule  or  discipline  is  an  algorithm 

accordina  to  which  a  schedule  is  selected. 
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in  the  P'RTOrity  columns  indicates  that  SPB  is  practiced  there. 

■'^’or  the  time  being,  the  letters  P,  S  and  B  which  occasianally 
appear  between  the  values  of  the  scheduling  parameters  may  ba 
ignored . 

The  most  important  observation  to  be  made  concerning  Table 
5.3  is  that  in  each  of  the  twenty-one  trials  there  is  a  pair  of 
full  priority  schedules  (one  for  each  processor)  yielding  total 
processor  utilization  at  least  as  high  as  any  combination  that 
uses  service  rate  biasing.  Reference  to  Table  5.1  shows  that 
even  in  the  pathological  cases,  trials  8  and  18,  there  is  a  full 
priority  combination  that  is  superior  to  FCFS. 

It  would  be  interesting  to  prove  the  conjecture  that  in  a 
system  of  two  processors  and  two  program  classes  as  considered  in 
this  thesis,  a  pair  of  full  priority  schedules  is  sufficient  to 
achieve  maxi  mum  total  processor  utilization,  at  least  among 
schedules  from  the  preemptive  prioritv  family.  "^he  next  several 
paragraphs  provide  motivation  for  this  proof,  which  appears  in 
Appendix  A. 

The  state- transit  ion  model  of  the  system  under  consideration 
C^igure  4.3)  has  four  states.  It  should  be  obvious  that  the 
utilization  of  processor  zero  is  equal  to  Brf  (0,0)  ]  +  Pr^  (0,1)  ]  + 
PrT  (1,0)  ],  and  that  the  utilization  of  processor  one  is  equal  to 
Prf  (0,1)  1  +  Prf  (1,0)  ]  t  pr[  (1,1)  ].  Total  processor  utilization 
was  defined  to  be  the  average  of  these  two,  or  (1  +  PrT  (0,1)  ]  t 
^r[  (1,0)  1  )/2. 

The  steady-state  probabilities  in  the  above  expression  are 
obtained  by  solving  the  balance  equations  of  the  model,  as 
discussed  in  Section  4.3.  These  balance  equations,  and  thus  the 
resultant  expression  for  total  processor  utilization,  contain  six 
variables:  the  burst  rate  of  each  program  class  at  each 

processor,  and  the  two  scheduling  parameters. 

Siven  this  formulation,  the  proof  becomes  a  constrained 
optimization  problem:  Show  that  for  any  positive  rate  values, 
the  total  processor  utilization  function  is  maximized  in  one  of 
the  four  cases  where  the  two  scheduling  parameters,  constrained 
to  the  closed  interval  f  0,1],  take  on  integer  values.  Recall 
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TRIAL  NO.  1 

«ATE  set:  ^QLm^I _ CLx_2 

OEV,  01  1  1 

HEV,  II  1  1 

scheduling  parameters  IPRIO.I  utilizations  -  %  I  BLOWUPS 

aiJL£±0Liil,QlIIUirlIiJL-iI-l_LJ2£^£_QL^l^IPIAL-.l-a^J _ QLx_2 


0.500 

0.500 

0 

0 

66 . 66 

66 .66 

66 .66 

1.5 

1.5 

0.500 

1.000 

0 

1 

66 .66 

66 .66 

66.66 

1.0 

2.0 

0.500 

0.000 

0 

2 

66.66 

66 . 66 

66.66 

2.0 

1.0 

1.000 

0.500 

1 

0 

66.66 

66 . 66 

66 . 66 

1.0 

2.0 

1.000 

1.000 

1 

1 

66 . 66 

66 . 66 

66.66 

1.0 

3.0 

1.000 

0.000 

1 

2 

66,66 

66 . 66 

66.66 

1.5 

1.5 

0.000 

0.600 

0 

66.66 

66 . 66 

66.66 

2.0 

1.0 

0.000 

1.000 

2 

1 

66 . 66 

66 . 66 

66 . 66 

1.5 

1.5 

0.000 

0.000 

2 

2 

66.66 

66.66 

66,66 

3.0 

1.0 

TRIAL  NO. 
RATE  SET: 

scheduling 

Qi(aii2UiI 

2 

^QL 

DEV.  01 

DEV.  11 

PARAMETERS 
Q(n»l)_f  1)  ) 

_ 

100 

1 

IPRIO. 
1  0  1 

1 

1 

1  utilizations  -  %  1 

l_D_EV.O  DEV.l  TOTAL  1 

BLOWUPS 

CL^1 

0.990 

0.500 

0 

0 

33.99 

99.66 

66.62 

1.5 

1.5 

0.990 

1.000 

0 

1 

1  .96 

99. 9B 

50.97 

1.0 

51.5 

0.990 

0.000 

0 

2 

50.24 

99.49 

74.87 

2.0 

1.0 

1.000 

0.500 

1 

0 

33.99 

99.66 

66.82 

1.5 

1.5 

1.000 

1.000 

1 

1 

1.96 

99.98 

50.97 

1.0 

51.5 

1.000  R 

0.000 

1 

2 

50.24 

99.50 

74.87 

2.0 

1.0 

0.000 

0.500 

2 

0 

33.99 

83.00 

58,49 

2.0 

1.5 

0.000 

1.000 

2 

1 

1.96 

99.01 

50.49 

1.0 

51.0 

0.000 

0.000 

2 

2 

50.24 

74.87 

62.56 

4.0 

1.0 

trial  no.  3 

RATE  set:  _ 

DEV.  01  100  100 

DEV.  II  1  1 


SCHEDULING  PARAMETERS  IPRIO.I  UTILIZATIONS  -  ^  I  BLOWUPS 


QiiaifiUii 

_QiIUUUUi_lL.I_L_i2£LiulL.QL^JLj:SIAL»J_£Lx_J _ 

0.500 

0.500 

0 

0 

0,99 

99.99 

50.49 

2.0 

2.0 

0.500 

1.000 

0 

1 

0.99 

99.99 

50.49 

1.0 

100.5 

0,500 

0.000 

0 

2 

0.99 

99.99 

50.49 

100.5 

1.0 

1.000 

0.500 

1 

0 

0.99 

99.99 

50.49 

2.0 

2.0 

1.000 

1.000 

1 

1 

0.99 

99.99 

50.49 

1.0 

101.0 

1.000 

0.000 

1 

2 

0.99 

99.99 

50.49 

100,0 

1.0 

0.000 

0.500 

2 

0 

0.99 

99.99 

50.49 

2.0 

2.0 

0.000 

1.000 

2 

1 

0.99 

99.99 

50.49 

1.0 

100.0 

0.000 

0.000 

2 

2 

0.99 

99.99 

50.49 

101.0 

1.0 

Table  5.3 
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TRIAL  NO.  4 

rate  set:  _CLx_J _ 

dev.  01  100  1 

DEV.  II  100  1 


scheduling  parameters  IPRIO.I  utilizations  -  5^  I  BLOWUPS 


oiioiliun 

U  UU  ii  U  jL_iL- i_ 

.XL^i _ cl 

0.990 

0.990 

0 

0 

74.75 

74.75 

74,75 

1.0 

2.0 

0.990 

1.000 

0 

1 

74.81 

74.81 

74.81 

1.0 

2.0 

0.990 

0.000 

0 

2 

66.55 

66  •  55 

66.55 

1.5 

1.5 

1.000 

0.990 

1 

0 

74.81 

74.81 

74,81 

1.0 

2.0 

1.000 

S  1.000 

1 

1 

74.87 

74.87 

74,87 

1.0 

2.0 

1  .000 

0.000 

1 

2 

66.66 

66.66 

66,66 

1.5 

1.5 

0.000 

0.990 

0 

66 . 55 

66.55 

66 . 55 

1.5 

1.5 

0.000 

1.000 

2 

1 

66 . 66 

66.66 

66.66 

1.5 

1.5 

0.000 

0.000 

2 

2 

5  0.49 

50.49 

50,49 

102.0 

1.0 

trial  no. 
RATE  SET; 

5 

cL^_a 

DEV.  01 

100 

1 

DEV.  11 

1 

100 

SCHEDULING 

parameters 

1  prig. 

1  utilizations  -  % 

1  BLOWUPS 

QliOiiiliii 

QllIiD  fllll 

1  QEV...0 

DE  V^l  . 

total. 

1  CL.  1  CL 

.  ^ 

0.990 

0.009 

0 

0 

99.00 

90.00 

09.00 

1.0 

1.0 

0,990 

1.000 

0 

1 

33.99 

99.33 

66.66 

1,0 

3.0 

0.990 

0.000 

0 

2 

99.01 

99.00 

90,01 

1.0 

1.0 

1.000 

0.009 

1 

0 

90.00 

99.01 

90.01 

1.0 

1.0 

1.000 

1.000 

1 

1 

33.99 

99.33 

66 , 66 

1.0 

3.0 

1.000 

8  0.000 

1 

2 

99.01 

90,01 

99.01 

1.0 

1.0 

0.000 

0.009 

2 

0 

99.33 

33.99 

66.66 

3.0 

1.0 

0.000 

1.000 

2 

1 

50.24 

50,24 

50,24 

2.0 

2.0 

0.000 

0.000 

2 

2 

90.33 

33.99 

66,66 

3.0 

1.0 

TRIAL  NO.  4 

«ate  set:  _£La._i„__aLx_a 

DEV.  01  100  100 

DEV.  II  100  1 


scheduling  parameters  IPRIO.I  UTILIZATIONS  -  ‘ih  I  BLOWUPS 


Qiiaiouu 

DFV.O 

.DFV^i, 

.JOIAL. 

.JL CL^JL 

._£L*_2 

0.500 

0,990 

0 

0 

50.08 

99,33 

74.71 

1.0 

2.0 

0,500 

1.000 

0 

1 

50.33 

99.33 

74.83 

1.0 

2.0 

0.500 

0,000 

0 

2 

1.64 

99.33 

50.49 

76.0 

1.0 

1.000 

0.990 

1 

0 

50.24 

90,01 

74,63 

1.0 

2.0 

1.000 

1.000 

1 

1 

50.49 

99.01 

74.75 

1.0 

2.0 

1.000 

0,000 

1 

2 

1.96 

99.01 

50.49 

51.0 

1.0 

0,000 

0,990 

2 

0 

50,00 

99.50 

74.75 

1.0 

2.0 

0.000 

R  1.000 

2 

1 

50.24 

99,50 

74.87 

UO 

2.0 

0.000 

0.000 

2 

2 

1  .48 

99.50 

50.49 

101.5 

1.0 

Table  S.3  -  2 
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TRIAL  NO.  7 

rate  set:  ^QL^l _ 

OEV.  01  1  100 

DEV.  II  10000  1 


scheduling  parameters  IPRIO.I  utilizations  -  ^  I  BLOWUPS 


QIiaxilLiil 

0  tll*il» llll 

0 

1  1 

1 DEV.O 

DEV.l 

TOTAL  1,  CL 

_ CL^_a 

0.009 

0.999 

0 

0 

99.99 

99.00 

99.49 

1.0 

1.0 

0,009 

1.000 

0 

1 

99.99 

99,00 

99.49 

1.0 

1.0 

0,009 

0,000 

0 

2 

50.25 

99.00 

74,62 

2.0 

1.0 

1.000 

0,999 

1 

0 

90.99 

0.99 

50.49 

1.0 

101.0 

1.000 

1.000 

1 

1 

99,99 

0.99 

50.49 

1.0 

101.0 

1,000 

0.000 

1 

2 

99.00 

0.99 

50.00 

1.0 

100.0 

0,000 

0.999 

2 

0 

90.99 

99.00 

99,60 

1.0 

1.0 

0,000  1 

B  1.000 

2 

1 

99,99 

99.00 

99,50 

1.0 

1.0 

0,000 

0,000 

2 

2 

50.24 

99.01 

74,63 

2.0 

1.0 

trial  NO.  B 
RATE  set; 

DEV.  01  1  100 

DEV,  II  1  10000 


SCHEDULING  PARAMETERS  IPRIO.I  UTILIZATIONS  -  %  I  BLOWUPS 


QiiUU 

riUi_0_ 

.i  1 

OCVxO 

.TOiaL_i_£a.*_i _ 

0.009 

0.000 

0 

0 

99.98 

2.90 

51.44 

26.0 

1.0 

0.009 

1.000 

0 

1 

98.08 

2.88 

50.48 

26,0 

1.0 

0.009 

0,000 

0 

2 

99.98 

2.90 

51 .44 

26,0 

1.0 

1.000 

0.000 

1 

0 

99,50 

50.24 

74.87 

1,0 

2.0 

1,000 

1.000 

1 

1 

50,49 

50.00 

50.24 

1.0 

200.0 

1,000 

R  0.000 

1 

2 

99.50 

50.24 

74. b7 

1.0 

2.0 

0.000 

0,000 

2 

0 

99,99 

1  .97 

50,98 

51.0 

1.0 

0,000 

1.000 

2 

1 

99.01 

1  .96 

50,49 

51.0 

1.0 

0,000 

S  0.000 

2 

2 

99.99 

1.97 

50.98 

51.0 

1.0 

trial  NO,  9 

rate  SET:  _ QLju^Z 

OEV.  01  10000  100 

OEV.  11  1  1 


SCHEDULING  PARAMETERS  IPRIO.I  UTILIZATIONS  -  %  I  BLOWUPS 


ailOiHIin 

QUUUiiUi 

(L 

>I2£V^£L 

.IOTiiL_i_CL*_l _ 

0.990 

0,600 

0 

0 

0,50 

99.99 

50.25 

2.0 

2.0 

0.990 

1.000 

0 

1 

0.01 

99.99 

50,00 

1.0 

'0,990 

0.000 

0 

2 

0.99 

99.99 

50.49 

101.0 

1.0 

1.000 

0.500 

1 

0 

0.50 

99.99 

50,25 

2.0 

2.0 

1.000 

1.000 

1 

1 

0,01 

99.99 

50.00 

1.0 

1.000 

R  0.000 

1 

2 

0.99 

99.99 

50,49 

101,0 

1.0 

0,000 

0.500 

2 

0 

0.50 

99.99 

50.24 

2.0 

2.0 

0.000 

1.000 

2 

1 

0.01 

99.99 

50.00 

1.0 

0.000 

0.00  0 

2 

2 

0.99 

99.99 

50,49 

102.0 

1.0 

'^able  5 . 3 
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TRIAL  NO.  10 

RATE  SET:  _ QLj..Z 

DEV.  01  1  100 

DEV.  II  100  10000 

SCHEDULING  PARAMETERS  IPRIO.I  UTILIZATIONS  -  %  I  BLOWUPS 


0.009 

0.009 

0 

0 

99.99 

0.99 

50.49 

50.5 

1.0 

0.009 

1.000 

0 

1 

99.98 

0.99 

50.49 

50.5 

1.0 

0.009 

0.000 

0 

2 

99.99 

0.99 

50,49 

50,5 

1.0 

1,000 

0.009 

1 

0 

99.99 

0.99 

50.49 

1.0 

100.0 

1.000 

1,000 

1 

1 

99.50 

0.99 

50.24 

1.0 

200.0 

1.000 

0,000 

1 

2 

99.99 

0.99 

50.49 

1.0 

100.0 

0,000 

0,009 

2 

0 

99.99 

0.99 

50.49 

100,0 

1.0 

0.000 

1.000 

2 

1 

90.99 

0.99 

50.49 

100.0 

1.0 

0,000 

S  0.000 

2 

2 

99.99 

0.99 

50,49 

100,0 

1.0 

TRIAL  NO.  11 

RATE  SET:  _ CLjl_2 

DEV.  01  100  100 

DEV,  II  1  10000 


SCHEDULING  PARAMETERS  IPRIO.I  UTILIZATIONS  -  %  I  BLOWUPS 


Q ( (OiOlil) 

0(  (Ifl)  .  1)  )  1 

0 

1  1 

1  DEV^O 

DEV.  1 

TOTAL  1  CL 

^1 _ CL._2 

0.500 

0.000 

0 

0 

99.02 

98.07 

98.55 

1.0 

1.0 

0.500 

1.000 

0 

1 

2.R9 

98,07 

50.48 

1.0 

51.5 

0.500 

0.000 

0 

2 

99.02 

98.07 

98.55 

1.0 

1.0 

1.000 

0.000 

1 

0 

99.01 

9Q.01 

99.01 

1.0 

1.0 

1.000 

1.000 

1 

1 

1.96 

99.01 

50.49 

1.0 

102,0 

1.000  r 

?  0.000 

1 

2 

99.01 

99.01 

99.01 

1.0 

1.0 

0.000 

0.000 

2 

0 

99.50 

50,24 

74.87 

2.0 

1.0 

0.000 

1.000 

2 

1 

50.24 

50.24 

50.24 

2.0 

2.0 

0.000 

0.000 

2 

2 

99.50 

50.24 

74.87 

2.0 

1.0 

trial  no.  1? 

RATE  SET:  _ Cl 

DEV.  01  100  10000 

DEV.  II  100  1 


scheduling  parameters  IPRIO.I  UTILIZATIONS  -  %  I  BLOWUPS 


-QilUililUi 

0  1 

1 I»EV.0 

DEV.  1 

total 

1  CL.  1 

CL.2 

0.009 

0.990 

0 

0 

49.75 

99.99 

74,87 

1.0 

2.0 

0,009 

1.000 

0 

1 

50.00 

99.99 

74,99 

1.0 

2.0 

0.009 

0.000 

0 

2 

0.01 

99.99 

50.00 

1.0 

1,000 

0.990 

1 

0 

49.75 

99.50 

74,62 

1.0 

2.0 

1.000 

1  .000 

1 

1 

50.00 

99.50 

74.75 

1.0 

2.0 

1.000 

0.000 

1 

2 

0,01 

99.99 

50.00 

1.0 

0.000 

0,990 

2 

0 

49.75 

99.99 

74,87 

1.0 

2.0 

0.000 

B  1.000 

2 

1 

50.00 

99.99 

74.99 

1.0 

2.0 

0.000 

0.000 

2 

2 

0.01 

99.99 

50.00 

1.0 

Table  5.3 
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TRIAL  NO.  13 

rate  set;  ^QLs^l _ QL^Z 

OEV.  01  1  10000 

DEV.  II  100  10000 


scheduling  parameters  IPRIO.I  utilizations  -  %  I  BLOWUPS 


Q(  (0Alllj.li 

Q(ii».ii»n  11 

0 

1  1 

1  DEV^O 

DEV.l 

TOTAL 

1  CL.  1 

CL.2 

0.000 

0.009 

0 

0 

99.50 

50.49 

74.99 

2.0 

1.0 

0,000 

1.000 

0 

1 

99.50 

50.24 

74.87 

2.0 

l.O 

0,000 

0.000 

0 

2 

99,50 

50.49 

74.99 

2.0 

1.0 

1.000 

0.009 

1 

0 

99.01 

1.95 

50.48 

1.0 

51.5 

1.000 

1,000 

1 

1 

99.01 

0.99 

50.00 

1.0 

1.000 

0.000 

1 

2 

99.01 

1.96 

50.49 

1.0 

51.0 

0.000 

0.009 

d 

0 

99.50 

50.49 

74.99 

2.0 

1.0 

0,000 

R  1.000 

2 

1 

99.50 

50,24 

74.87 

2.0 

1.0 

0.000 

S  0.000 

2 

2 

99.50 

50.49 

74.99 

2.0 

1.0 

TRIAL  NO.  14 

rate  set:  _ CL^a 

OEV.  01  1  lOOOO 

DEV.  II  10000  100 


SCHEDULING  PARAMETERS  IPRIO.I  UTILIZATIONS  -  %  I  BLOWUPS 


ailUQ.l±ll 

QlILiUiJLIlL 

(L 

-1  J 

L-nE^iL.Q£:^^l._I^iTAL-L-£L 

_ CL»_2 

0.000 

0.990 

0 

0 

99.99 

99.00 

99.49 

1.0 

1.0 

0.000 

1.000 

0 

1 

99.99 

99.00 

90.50 

1.0 

1  .0 

0.000 

0.000 

0 

2 

98.05 

99,01 

98.53 

1.0 

1.0 

1.000 

0.990 

1 

0 

99,99 

0,50 

50.25 

1,0 

199,0 

1.000 

1.000 

1 

1 

99.99 

0.50 

50.25 

1.0 

199.0 

1.000 

0.000 

1 

2 

99.00 

0.99 

50,00 

1.0 

100.0 

0.000 

0.990 

2 

0 

90.99 

99.00 

99.50 

1.0 

1.0 

0.000 

B  1,000 

2 

1 

99.99 

99.00 

99,50 

1.0 

1.0 

0.000 

0,000 

2 

2 

98.05 

99.01 

98.53 

1.0 

1.0 

TRIAL  NO.  IS 

rate  SET:  _ _ CL^  Z 

DEV.  01  10000  10000 
DEV.  II  1  100 

SCHEDULING  PARAMETERS  IPRIO.I  UTILIZATIONS  -  %  I  BLOWUPS 


0.500 

0.009 

0 

0 

0.98 

99.99 

50.49 

51.0 

1.0 

0.500 

1.000 

0 

1 

0.01 

99.99 

50.00 

1.0 

0.500 

0.000 

0 

2 

0.99 

99,99 

50.49 

101.0 

1.0 

1.000 

0.009 

1 

0 

0.98 

99.99 

50.49 

51,0 

1.0 

1.000 

1.000 

1 

1 

0.01 

99.99 

50.00 

1.0 

1.000 

R  0.000 

1 

2 

0.99 

99.99 

50.49 

101.0 

1.0 

0,000 

0,009 

2 

0 

0.98 

99.99 

50.49 

51.0 

1.0 

0,000 

1,000 

2 

1 

0.01 

99.99 

50.00 

1.0 

0.000 

0.000 

2 

2 

0.99 

99,99 

50.49 

101.0 

1.0 

Table  5.3  -  5 


TRIAL  NO.  16 

RATF  SET:  _ 

OEV,  01  1  100 

DEV,  II  10000  1000000 

scheduling  parameters  IPRIO.I  utilizations  -  %  I  BLOWUPS 


ail0iili-tlI_QiIUUiJ.liL-Q-l-LJ2£^^QL£lil.-IoiAL_J-£Lx,l _ QLx-2 


0.009 

0.009 

0 

0 

9Q.99 

0.00 

SO. 00 

100,0 

1.0 

0.009 

1.000 

0 

1 

9q,99 

0,00 

50.00 

100.0 

1.0 

0,009 

0.000 

0 

2 

99.99 

0,00 

50,00 

100.0 

1.0 

1.000 

0,009 

1 

0 

99.99 

0.00 

50,00 

1.0 

1.000 

1,000 

1 

1 

99.99 

0.00 

50.00 

1.0 

1.000 

0.000 

1 

2 

99.99 

0.00 

50.00 

1.0 

***** 

0.000 

0.009 

2 

0 

99,99 

0.00 

50,00 

1.0 

0.000 

1,000 

2 

1 

99,99 

0,00 

50.00 

1.0 

0,000 

S  0,000 

2 

2 

99.99 

0.00 

50.00 

1.0 

TRIAL  NO,  17 

rate  set:  _ QLjl^Z 

PEV,  01  10000  100 

OEV.  II  1  1000000 


SCHEDULING  PARAMETERS  IPRIO.I  UTILIZATIONS  -  %  I  BLOWUPS 


(OiOUii 

Q{il»ll»l)  )  1 

0 

1  1 

1  QEV^.0 

DEV.l 

TOTAL  1 

CL.  1 

CL.2 

0.990 

0.000 

0 

0 

99.99 

99,98 

99.98 

1.0 

1.0 

0.990 

1.000 

0 

1 

0.99 

99.99 

50.49 

1.0 

102.0 

0.990 

0.000 

0 

2 

99.99 

99.98 

99.98 

1.0 

1.0 

1.000 

0.000 

1 

0 

99.99 

99.99 

99.99 

1.0 

1.0 

1.000 

1.000 

1 

1 

0.99 

99,99 

50,49 

1.0 

102,0 

1.000 

9  0,000 

1 

2 

99.99 

99,99 

90,99 

1.0 

1,0 

0.000 

0.000 

2 

0 

99. 99 

49.75 

74.87 

2.0 

1.0 

0.000 

1,000 

2 

1 

50.00 

50.00 

50.00 

2.0 

2.0 

0.000 

0.000 

2 

2 

99.99 

49.75 

74.87 

2.0 

1.0 

trial  no.  Ifl 

rate  set:  _ CL.  g 

OEV.  01  100  10000 

DEV,  II  1  1000000 


SCHEDULING  PARAMETERS  IPRIO.I  UTILIZATIONS  -  %  I  BLOWUPS 


fiiiOiilLtil 

QllUiliiliL 

0 

.i  J 

L_n£^o 

x_l _ CL^_2 

0.009 

0 .000 

0 

0 

99,34 

66.77 

83.05 

1.5 

1.0 

0.009 

1.000 

0 

1 

33.77 

66.55 

50.16 

1.5 

3,0 

0.009 

0,000 

0 

2 

99.34 

66.77 

83.05 

1.5 

1.0 

1.000 

0.000 

1 

0 

99.01 

99.01 

99.01 

1.0 

1.0 

1.000 

1.000 

1 

1 

0.99 

99.00 

50.00 

1.0 

*«■»** 

1.000 

R  0,000 

1 

2 

99.01 

99.01 

90,01 

1.0 

1.0 

0.000 

0.000 

2 

0 

99.50 

50.49 

74.99 

2.0 

1.0 

0.000 

1.000 

2 

1 

50.24 

50.24 

50,24 

2.0 

2.0 

0.000 

S  0.000 

2 

2 

90.50 

50.49 

74.99 

2.0 

1.0 

able  5.3 
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TRIAL  NO.  lu 

rate  set:  __C.LiL_J _ QJL^.Z 

OFV.  ul  lot)  1 

OEV.  II  10000  1000000 


scheduling  parameters  IPRIO.j  utilizations  -  %  I  BLOWUPS 


X0i9.LtiI 

n.dijll.-  Ill  1 

0 

1 

1 DEV.O 

DEV.l 

TOTAL  1  CL^  I  CL.._i: 

0.990 

0 .009 

0 

0 

9P.99 

0.98 

50,49 

1.0 

51.0 

0,990 

1.000 

0 

1 

99,99 

0.9R 

50.49 

1.0 

51.0 

0.990 

0.000 

0 

2 

99.99 

0.98 

50.49 

1.0 

51.0 

1.000 

0,009 

1 

0 

99.99 

0.99 

50.49 

1.0 

101.0 

1.000 

1.000 

1 

1 

90.99 

0.99 

50.49 

1.0 

101.0 

1,000  H 

0.000 

1 

2 

99,99 

0.99 

50,49 

1.0 

101.0 

o.oco 

0.009 

2 

0 

99.99 

0.00 

50.00 

1.0 

0.000 

1  .000 

2 

1 

99.99 

0.00 

50.00 

1.0 

0.000 

0.000 

2 

2 

99.99 

0.00 

50.00 

1.0 

trial  no.  ^0 

rate  set:  _ 

n£v.  01  10000  1 

[lEV.  II  1  00  1  000000 


scheduling  parameters  IPRIO.I  utilizations  -  %  I  BLOWUPS 


QilliOliil 

riliJ-l- 

0.999 

0.000 

0 

0 

99,99 

09.00 

90,50 

1.0 

1.0 

0,999 

1.000 

0 

1 

90,00 

99.00 

Oo,00 

1.0 

1  .0 

0.999 

0.000 

0 

2 

99.99 

99.00 

99.50 

1.0 

1.0 

1.000 

0,000 

1 

0 

99.99 

99.00 

99.50 

1.0 

1.0 

1 .000 

1.000 

1 

1 

99.00 

99,00 

90.00 

1.0 

1.0 

1.000 

R  0.000 

1 

2 

99.99 

99.00 

90.50 

1.0 

1.0 

0.000 

0.^00 

2 

0 

99.99 

0.00 

50.00 

1.0 

0.000 

1.000 

2 

1 

•  99,99 

0.01 

50.00 

1.0 

0.000 

0.000 

2 

2 

99.99 

0.00 

o 

. 

o 

o 

1.0 

trial  no.  ?1 

PATE  SET:  _ QLjl.Z 

UEV.  01  100  loorono 

OEV.  II  1  10000 


SCHEDULING  PAPAME'^f'.RS  IPRIO.I  ( U  I !_  T  Z  A  T  I ONS  -  ^  I  BLOWUPS 


QliOiOijtii 

_yi,j 

,iUL„D„ 

.D.,LX.tJL.IOIXL_ 

0.000 

0  .  (  0  0 

0 

0 

0 , 99 

9^-  ,99 

50,49 

99.0 

1.0 

0.000 

1  .  0  0 

0 

1 

n  .'-^9 

9c  ,9  ? 

50,49 

1.0 

100.0 

0.000 

0 . 0  0  0 

0 

2 

0,99 

^9,99 

50.49 

100.0 

1.0 

1.000 

0  .  0  0 

1 

0 

0.99 

99.99 

50,49 

99.0 

1.0 

1.000 

1  ,  ’)0  0 

1 

1 

0 ,9^^ 

9  9,01 

E0,00 

1.0 

1.000 

0  .^00 

1 

2 

0.99 

up  ,93 

50,49 

100.0 

1.0 

0,000 

0  .  0  0 

2 

0 

r  .99 

pc-.  99 

50.49 

99.0 

1.0 

0.000 

1  ,  0  0  0 

2 

1 

0.99 

9^5 .99 

50,49 

1.0 

100.0 

0.000 

S  0  .  (...  r.  0 

2 

2 

0.99 

9  P.99 

50.49 

100.0 

1.0 

’  T 

S.3  - 
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tb?.t  thesf^  two  srhodulina  parametprs,  q{(0,0),1)  and  q((1r1)r1)r 
reprasent  rpsp<^ct i vel y  the  priority  of  the  class  one  program  at 
orocessors  zero  and  one. 

i. 

Consider  the  four  rates  to  be  fixed,  and  view  total  processor 
u-^ilization  as  a  surface  over  the  (  g((0,0),1)  ,  ) 

plane.  If  the  function  is  monotone  in  each  scheduling  parameter, 
then  certainly  maximum  utilization  occurs  at  one  of  the  four 
corners  of  the  plane,  and  some  pair  of  full  priority  schedules  is 
optimal.  To  prove  the  conjecture  in  general,  it  is  necessary  to 
demonstrate  that  the  function  is  monotone  in  each  scheduling 
parameter  for  all  possible  rate  values.  This  is  done  in  Appendix 
A. 

As  an  aside,  it  is  interesting  to  note  that  service  rate 
biasing  performs  very  nearly  optimally  well  in  all  but  three 
trials:  2,  8  and  18.  Trial  2  thus  joins  the  list  of  rate  sets 

that  result  in  unusual  behavior. 

8.^  "Good"  full  2]ll2§ 

In  Section  5.3  and  Appendix  A,  it  is  proven  that  some  pair  of 
full  priority  schedules  is  optimal  with  respect  to  total 
processor  utilization  in  a  2P2C  system.  For  any  particular  set 
of  burst  rates,  this  has  the  effect  of  narrowing  the  search  for 
an  optimal  schedule  pair  from  the  infinite  space  of  preemptive 
priority  schedules  to  the  four  full  priority  pairs.  The  next 
question  to  ask  is  whether  some  consistent  scheduling  rule  exists 
that  always  leads  to  the  best  full  priority  schedule  pair  when 
employed  at  both  processors. 

Two  full  priority  disciplines  will  be  discussed  in  this 
section:  shortest  burst  time  first  and  relative  shortest  burst 

time  first.  In  the  scheduling  parameter  columns  of  Table  5.3  the 
l‘=^tt?rs  S,  R  and  B  show  that  the  indicated  schedule  pair  was 
-dictated  by  the  SPT  rule,  the  BSBT  rule,  or  both  rules, 
respectively.  These  rules  will  be  defined  shortly.  Since  each 
of  f hem  is  unable  to  decide  upon  a  schedule  for  certain  rate 
values.  Table  8.3  does  not  always  have  an  indication  for  both 
rules  in  a  particular  trial. 
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■^he  conjecture  that  motivated  the  work  in  this  chapter 
extends  the  optical  single-processor  scheduling  rule  by  claiming 
that  scheduling  shortest  burst  time  first  at  each  processor  is 
optimal  with  respect  to  total  processor  utilization  in  a  292- 
system . 

Definition:  '^he  shortest  burst  time  first  (SPT)  scheduling  rule 

assigns  full  preemptive  priority  at  a  processor  to  the 
program  class  with  the  shortest  burst  time  (highest  burst 
rate)  at  tha+  processor. 

'^he  conjecture  is  false  by  counter-examples  from  Table  5.3;  in 
trials  8  and  18  (as  a  pessimist  might  expect)  the  schedule  pair 
specified  by  SBT  is  inferior  to  another  pair,  and  in  trials  2,  6, 
9,  11  ani  15  the  classes  have  identical  rates  on  one  processor 
and  SPT  Drovides  no  way  to  choose  between  them  at  that  processor. 
In  most  of  these  cases,  though,  some  pair  is  clearly  superior. 

"he  two  trials  in  which  the  program  classes  have  identical  burst 
rates  (1  and  3)  are  ignored. 

Pittempts  to  find  a  simple  scheduling  rule  optimal  in  eacn  of 
the  twenty-one  trials  proved  frustratina.  It  was,  however, 
possible  to  define  a  new  rule  that  selects  the  optimal  schedule 
pair  in  each  trial  where  SB"’  failed.. 

D  e''^in  i  t  ion  :  *^10  relative  shortest  burst  time  first  (HSBT) 

discipline  assigns  full  preemptive  priority  at  a  processor  to 
the  program  class  that  spends  the  smallest  percentage  of  its 
time  at  ■'-hat  processor. 

As  usual,  consider  the  rate  matrix: 

Class_1_ _ ^Zi3.ss_2 

Dev.  ^  I  a  b 

Dev.  1  1  c  d 

SB'^  assigns  priority  at  processor  zero  to  program  class  one  if 
and  only  if  a  >  b.  If  a  =  b,  is  unable  to  choose  between  the 

classes.  PSFT  assicns  prior’ •'■v  a-'-  processor  zero  to  program 
class  one  if  and  only  if  ad  >  be.  This  inequality  was  obtained 
by  notina  that  class  one  has  a  relatively  shorter  burst  time  at 
processor  zero  if: 
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1  1 

5  <  P 

T - T  T - T 

a  c  P  +  P’ 

_ c _ < 

a  +  c  P  +  P 

be  +  cd  <  ad  +  cd 

ad  >  be 

In  eaeh  of  the  six  trials  where  the  SBT  rule  either  is  unable 
to  choose  between  program  classes  or  selects  the  wrong  schedule 
D?ir,  selects  the  correct  pair.  Life  is  unfortunately  not 

so  simple,  though:  like  SBT,  occasionally  selects  the 

wrong  pair  (trial  13)  or  is  unable  to  choose  between  program 
classes  (trials  ,  1C,  16  and  21)  . 

The  remainder  of  this  chapter  is  devoted  to  the  exploration 
o"  a  ’’compound"  scheduling  discipline,  SET-PSBT. 

B^^ini-^-ion :  The  shortest  burst  time  f 

fi^st  scheduling  rule  gives  full  preemptive 
nriority  at  a  processor  to  the  same  program  as  the  "better" 
(according  to  total  processor  utilization)  of  the  two  rules 
SBT  and  FSB"’. 

■^or  this  definition  to  be  either  meaningful  or  significant,  it  is 
n==c=>ssary  to  generalize  the  performance  of  SBT  and  PSBT  beyond 
+  rate  sets  of  the  twenty-one  trials.  In  terms  of  general 
"can’t  choose"  situations,  the  rules  are  obviously  complementary: 
If  one  is  unable  to  choose  between  program  classes,  the  other  can 
always  make  a  choice.  ^or  many  sets  of  burst  rates,  SBT  and  FSBT 
lead  to  the  same  schedule.  Occasionally  they  specify  different 
schedules  (or  schedule  pairs,  if  applied  at  both  processors). 

Two  things  must  be  demonstrated:  First,  that  it  is  possible  to  a 
select  the  better  of  the  two  individual  rules  in  the 
situations  where  they  specify  different  schedules.  Second,  that 
•*■10  schedule  pair  specified  by  globally-applied  SBT-PSBT  is 
optimal  with  respect  to  total  processor  utilization  in  a  2P2T 
system. 

Under  what  circumstances,  then,  do  SBT  and  PSBT  specify 
differen-*-  schedule  pairs  when  applied  at  both  processors?  When 
is  one  rule  better  than  the  other  with  respect  to  total  processor 
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utili7ation?  Thp  first  problem  is  to  minimize  the  -number  of 
cases  that  must  be  considered.  Because  of  the  processor  and 
program  class  symmetries  of  the  2P2C  system,  only  the  following 
cases  need  be  treated: 

SBT  dictates:  RSPT  dictates: 


1. 

(If  1] 

!  (1,2) 

2. 

(1,1 

1  (2,2) 

3. 

(1,2 

1  1/1 

4. 

(1,2 

1  2,1 

Cases  two  and  three  can  be  eliminated  by  demonstrating  that  RSBT 
can  never  lead  to  a  schedule  that  aives  priority  to  the  same 
program  class  at  each  processor,  as  follows: 

Consider  the  usual  rate  matrix.  For  RSBT  to  lead  to  the 
schedule  (1,’*')  (where  means  ’’don't  care”)  requires  a/(a+c)  > 

b/(b^-d),  or  ad  >  be.  For  RSBT  to  lead  to  the  schedule  (*,1) 

requires  c/(a+c)  >  d/(b+d),  or  be  >  ad.  The  schedule  (1,1) 
derived  from  the  RSBT  discipline  is  thus  contradictory.  A 
symmetric  argument  leads  to  a  corresponding  contradiction  for  the 
schedule  (2,2) . 

Case  four  can  also  be  eliminated,  by  the  following  argument: 
For  SBT  to  lead  to  •’■he  schedule  pair  (1,2)  requires  a  >  b  and  d  > 
c.  Ror  RSBT  to  dictate  the  schedule  (2,*)  requires  ad  <  be. 

These  two  sets  of  inequalities  are  clearly  contradictory. 

It  is  only  necessary-  to  consider,  then,  the  case  where  SBT 
specifies  the  schedule  (1,1)  and  RSBT  dictates  (1,2).  This 
occurs  when  the  following  inequalities  are  satisfied: 

1  .  a  >  b 

2.  c  >  d 

3.  ad  >  be 

■^hese  •’■hree  inequalities  are  equivalent  to  the  following  single 
statement: 

u .  a/b  >  c/d  >  1 

T {^)  is  sat'sf?ed,  i.e.,  if  S’=*T  and  RSP*^  lead  to  different 
schedules,  w'^en  does  PS'^’’’  vield  hiaher  total  utilization  than 
S’=’T?  '’’o  answer  th^s  trues-'-von,  it  is  nec^^ssary  to  analytically 

solve  ■’^or  total  procf^ssor  utilization  in  terms  of  steady-state 
nroba  bil  i  t  ie-s ,  as  in  Sec-^.ion  5.3  and  Appendix  A. 
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Solutipn  of  the  balance  equations  for  the  schedule  pair 
(1,1),  corresponding  to  SBT,  gives  the  following  steady-state 
probabilities : 


Pr[  (0,1)  1 


Pr[  (1,0)  1 


_  abc _ 

7a + c)"7^^ 

acd 

TaTcrTa5T53+cc[r 


Total  utilization  (defined  to  be  the  average  of  the  utilizations 
of  the  two  processors)  for  this  schedule  pair  equals: 

1  +  ac(b  +  d) 

2  2Ta+crtab+5^+c3r 


^or  the  schedule  pair  (1,2),  corresponding  to  PSBT,  the  steady- 
state  probabilities  are: 


Pr[  (0,1)  1 
Pr[  (4,0)  ] 


0 

ad 

^5  +  ad"  +  cd 


and  the  total  utilization  equals: 

1  +  _ ad _ 

2  2Tab  +  ad  +  cd)" 

Comparing  the  two  values  for  total  utilization,  we  have  R SBT 
superior  to  SBT  exactly  when: 

ac 


in  the  case  where  (from  (4)  above) : 
a/b  >  c/d  >  1 

A.dmittedly  the  ab-ove  formulation  is  somewhat  lacking  in  grace 
and  intuitive  appeal.  Nonetheless,  it  demonstrates  that  the 
conditions  under  which  RSBT  is  superior  to  SBT  can  be  stated 
precisely . 

Tt  is  possible  to  demonstrate  through  inequality  arguments 
that  the  schedule  pair  selected  by  globally-applied  SBT-RSET  is 
optimal.  Since  the  compound  discipline  is  defined  to  select  the 
"better”  of  the  individual  rules  in  each  situation,  and  since 
some  pair  of  full  priority  schedules  must  be  optimal,  it  is 
merely  necessary  to  show  that  for  every  set  of  burst  rates,  any 


full  priority  schedule  pair  not  specified  by  either  of  the  two 
rules  SBT  and  RSPT  is  inferior  to  a  pair  selected  by  one  of  these 
disciplines.  The  optimality  proof  appears  in  Appendix  B. 

One  might  claim  that  a  compound  discipline  such  as  SBT-P.SBT 
is  not  strictly  kosher.  It  should  be  pointed  out  that  not  only 
has  the  search  for  the  best  schedule  pair  been  narrowed  from  an 
infinite  number  (all  preemptive  priority  schedule  pairs)  to  four 
(the  full  priority  pairs)  and  then  to  two  (the  pairs  specified  by 
SBT  and  RSBT  individually),  but  frequently  the  optimal  pair  is 
immediately  and  uniquely  indicated.  This  occurs  in  the  many 
instances  where  globally-applied  SBT  gives  priority  to  a 
different  program  class  at  each  processor,  for  in  this  case  P.SBT 
is  optimal  (Lemma  2  of  Appendix  B)  and  SBT  can  be  guaranteed  to 
specify  the  same  schedule  pair.  Only  when  SBT  selects  the  same 
class  at  each  processor  is  it  necessary  to  resort  to  a  more 
complex  decision-making  process. 

It  is  interesting  to  examine  in  a  particular  case  the 
relative  performance  of  the  four  full  priority  schedule  pairs  as 
the  relationships  of  the  burst  rates  vary.  Figure  5.1  displays 
total  processor  utilization  for  each  pair  for  the  burst  rate 
matrix: 


Class_2 


Class_2 


■TU“ 

d 


Dev,  0  !  TUU 
Dev.  1  !  110 


as  parameter  cl  is  varied  from  a  small  value  to  150,  Initially, 
globally-applied  PSBT  specifies  (2,1)  and  SBT  specifies  (1,1). 

SBT  provides  highest  total  processor  utilization.  At  d  =  11, 

RSBT  is  unable  to  make  a  choice  between  program  classes,  and  as  d 
increases,  P.SBT  specifies  schedule  pair  (1,2).  SBT  continues  to 
yield  highest  utilization  until  roughly  d  =  70,  at  which  point 
RSBT  takes  over.  SBT’s  "can't  choose"  point  occurs  at  3  =  110, 
and  for  higher  values  SBT  and  PSBT  both  specify  (1,2). 

Mahl's  statements  regarding  the  use  of  relative  service  times 
appear  more  reasonable  (although  still  incorrect)  in  the  light  of 
this  data.  His  claim  that  a  rule  that  assigns  top  priority  at 
more  than  one  processor  to  the  same  program  is  sub-optimal  is 
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■Pigure  5 . 1 

consistent  with  the  fact  that  RSBT  cannot  make  such  an 
a  ssignment . 

The  robustness  of  SBT-RSBT  should  be  mentioned.  Figure  5.2 
displays  total  processor  utilization  as  a  function  of  the  two 
scheduling  parameters  for  one  of  the  twenty-one  rate  sets  (trial 
IB).  The  vertical  axis  is  g((0,0),1),  the  priority  of  program 
class  one  at  processor  zero.  The  horizontal  axis  is  g((1r1)»1). 
Utilization  values  are  displayed  at  scheduling  parameter 
intervals  of  0.05. 

Note  that  in  this  particular  trial  there  is  a  very  steep 
gradient  near  the  upper  and  right  edges  (the  edges  opposite  the 
optimal  schedule) .  This  indicates  that  a  very  bad  estimate  of 
burst  rates  that  led  to  setting  qt(0r0),1)  =  0  or  q((1r1)r1)  = 
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would  result  in  relatively  poor  total  processor  utilization. 

This  phenomenon  appears  in  most  trials,  but  becomes  negligible  as 
the  rates  become  close  to  each  other  (i.e.,  as  miscalculation 
becomes  more  likely)  . 

Generalizing  SBT-PSBT  to  systems  of  more  than  two  processors 
is  straightforward.  When  additional  program  classes  are  added, 
tie-breaking  among  classes  with  identical  characteristics  might 
be  a  problem.  In  view  of  the  superiority  of  the  discipline  in 
2P2C  systems,  such  extensions  seem  worth  pursuing. 

The  next  chapter  will  use  the  data  in  Table  5.3  to  study  the 
behavior  of  individual  programs  and  program-oriented  objective 
f  unctions. 
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Prog ram- oriented  objective  functions 


The  previous  chapter  explored  the  performance  of  various 
scheduling  rules  for  systems  of  two  processors  and  two  programs 
with  respect  to  total  processor  utilization.  This  is  a  more 
consistent  and  interesting  measure  than  the  utilization  of  only 
one  processor. 

Objective  functions  that  take  account  of  the  '•quality”  of 
service  afforded  to  programs  of  a  particular  class  are  also  of 
great  practical  interest,  particularly  if  one  views  a  computer 
system  as  an  open  marketplace  in  which  users  bid  against  one 
another  for  limited  resources.  Such  program-oriented  objective 
functions  are  the  subject  of  this  chapter. 

Few  queueing  analyses  concern  themselves  with  program- 
oriented  objective  functions.  Mitrani’s  [ Mitrani  1972b]  is 
probably  the  best  known  attempt,  although  he  makes  some  rather 
grievous  simplifying  assumptions. 


In  the  literature  of  job  shop  scheduling,  both  total  flow 
time  (the  interval  between  the  arrival  of  a  job  and  the 
completion  of  that  job)  and  service  span  (the  interval  between 
the  start  of  processing  on  a  job  and  the  completion  of  that  job) 
are  considered.  The  former  is  probably  the  more  reasonable 
measure  for  the  scheduling  of  batch  computer  systems.  In  systems 
employing  variations  of  round-robin  or  feedback  scheduling,  the 
second  measure  is  very  close  to  the  first  and  is  therefore  of 
interest.  Since  the  basic  queueing  model  of  this  thesis  ignores 
job  arrivals  and  external  queues,  a  measure  of  the  second  type 
will  be  used. 


The  blowup  factor  of  a  program  class,  described  briefly  in 
the  previous  chapter,  is  central  to  the  family  of  program- 
oriented  objective  functions  that  will  be  considered.  In  a 
system  of  two  processors  the  following  definitions  apply: 


Definition:  The  basic  cycle  time  of  a  program  class  is  the  sum 

of  its  expected  burst  times  on  each  processor.  In  other 
words,  it  is  the  expected  time  that  a  program  of  that  class 
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would  require  to  obtain  one  burst  of  service  at  each 
processor  (i.e.,  to  complete  one  cycle)  if  it  were  the  only 
program  in  the  system. 

’^^finition:  '^he  actual  cycle  time  of  a  program  class  is  the 

expected  time  tha+  a  program  belonging  to  that  class  requires 
to  complete  one  cycle  on  the  loaded  system. 

The  actual  cycle  time  of  a  program  class  is  obviously  a  function 
of  its  burst  t^-me  characteristics  (note  that  the  basic  cycle  time 
does  not  provide  complete  information  regarding  these 
characteristics) ,  the  schedule  employed  at  each  processor,  and 
the  burst  time  characteristics  of  the  other  program  class, 

Definition:  The  bi2JiME  £22122  of  ^  program  class  is  the  ratio  of 

its  actual  cycle  time  to  its  basic  cycle  time.  Equivalently 
(since  a  single  program  cannot  simultaneously  use  both 
processors) ,  it  is  the  inverse  of  the  percentage  of  real  time 
durincr  which  the  program  of  that  class  is  receiving  service 
at  some  processor. 

""he  blowup  factor  accurately  reflects  the  ’’quality”  of 
service  rendered:  the  degree  to  which  a  program  of  a  particular 
class  is  serviced  as  soon  as  it  requests  service,  or  the  degree 
to  which  programs  of  other  classes  are  allowed  to  interfere  with 
its  progress.  For  example,  if  a  program  class  has  full 
preemptive  priority  at  every  processor  (i.e.  g(*,<class>)  =  1) 
then  its  blowup  factor  is  unity.  Attempting  to  equalize  blowup 
factors  or  to  minimize  average  blowup  factor  are  both  reasonable 
objective  functions. 

Examination  of  the  tables  in  Chapter  Five  reveals  a  number  of 
interesting  phenomena  that  invite  exploration  and  explanation. 

■^n  a  more  complex  environment  (i.e.,  additional  program  classes 
and  processors)  it  would  be  difficult  to  isolate  the  cause  and 
=>^f='ct  relationships.  The  factorial  nature  of  the  experiments  in 
the  previous  chapter  allows  such  analysis,  however,  which  in  turn 
provides  intuition  that  may  be  useful  when  observing  the  behavior 
o-^  larger  systems.  The  remaining  sections  in  this  chapter  are 
devoted  to  discussions  of  these  phenomena. 
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A.n  examination  of  Table  5.1  reveals  that  in  each  of  t 
■^wenty-one  trials,  practicing  processor  sharing  at  both 
processors  resulted  in  equalising  the  blowup  factors  for 
program  classes.  It  can  be  shown  that  this  will  always  b 
independent  of  the  rate  set,  for  2P2C  systems: 

During  an  arbitrary  time  interval  of  duration  Z,  the 
will  spend  a  certain  amount  of  time  in  each  of  its  four  s 
Recall  that  the  state  designation  (i,j)  indicates  that  th 
program  belonging  to  class  one  is  at  processor  i  and  the 
belonging  to  class  two  is  at  processor  j. 

Let  the  total  time  interval  Z  be  divided  among  the  st 
follows: 

State  Time  in  state 

(0,0)  k 

(0,1}  1 

(1,0|  m 

(l,lj  n 

Obviously  k+l+m+n  =  Z. 

Under  the  processor  sharing  discipline,  each  program 
receiving  service  exactly  one-half  of  the  time  when  both 
are  at  one  processor  (states  C0,0)  and  (1,1)).  Over  the 
interval  Z,  the  class  one  program  receives  service  at  pro 
zero  for  k/2  +  1  time  units  and  at  processor  one  for  n/2 
units.  If  the  rate  matrix. is  taken  to  be  our  old  friend: 

Class_J[ _ Class_2 

Dev.  0  f  a  b 


Dev . 

1  1 

c 

d 

then 

the  class  one 

program  will 

have  a  basic  cycle  time  o 

1/a 

+  1/c 

and 

an  actual  cycl 

e  time 

of : 

Z 

Tc77  +  I  +  n72  +  m 
for  a  blowup  factor  of: 


he 

the  two 
e  true. 


syst  em 
tate  s. 
e 

program 


ates  as 


class  is 
classes 
time 
cessor 
+  m  time 


■F  . 


Zj_  _ 

Y72  +  T  +  n72  +  m 


(1/a  +  1/c) 
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Over  the  same  time  interval ,  the  class  two  program  receives 
service  at  processor  zero  for  k/2  +  m  time  units  and  at  processor 
one  for  n/2  +  1  time  units.  The  class  two  program  thus  has  a 
’^■asic  cycle  time  of: 

1/h  +  l/d 

and  an  actual  cycle  time  of: 

Z  (1/b  ^  1/1) 

lc7?"+“fn~T^n7?~+“r 

for  a  blowup  factor  of: 

Z 

777  +  m  +  n77  +  T 

The  blowup  factors  are  therefore  identical,  independent  of 
rate  set,  when  processor  sharing  is  practiced  at  both  processors 
in  2P2C  system. 

This  behavior  extends  naturally  to  models  with  more  than  two 
processors,  assuming  that  there  are  still  only  two  active 
programs.  It  fails  to  hold  when  additional  program  classes  are 
added;  intuit:  ve]  y,  it  is  possible  in  a  system  of  two  processors 
and  three  program  classes  that  two  of  the  classes  would  always  be 
in  conflict,  while  the  program  belonging  to  the  third  class  runs 
unmolested  and  has  a  blowup  factor  of  unity. 

If  the  objective  is  to  equalize  blowup  factors,  then 
processor  sharing  is  program-measure  optimal.  Since  PS  has  been 
shown  not  to  yield  highest  total  utilization,  it  is  obvious  that 
in  general,  processor-oriented  and  program-oriented  objective 
functions  are  net  necessarily  simultaneously  maximized.  This 
will  be  discussed  further  in  future  sections. 


6.2  Schedules  that  impact  a  Earticular  program  class 

Throughout  Table  5.3  are  instances  in  which  only  one  program 
class  exhibits  an  unusually  high  blowup  factor.  The  otter 
program  class  seems  ’’safe”  from  blowup  even  if  the  schedule  at 
each  processor  is  biased  against  it.  Trials  2,  4,  6,  7,  11^  12, 

13,  14,  17,  18  and  20  are  examples  of  this  phenomenon.  In  other 
trials  (3,  8,  9,  10,  T5,  16,  19  and  21)  either  program  class  may 
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occasionally  suffer  from  an  unusually  high  blowup  factor, 
depending  upon  the  schedules  employed. 

Intuitively,  a  program  will  be  slowed  appreciably  if  it  is 
consistently  denied  the  use  of  at  least  one  processor  when  it 
requests  service  there.  By  considering  all  twenty-one  trials  of 
Table  5.3  it  is  possible  to  formalize  slightly  the  conditions 
under  which  this  ’’lockout”  occurs: 

Program  Class  B  will  be  ’’locked  out"  at  a  processor  iff: 

Proaram  Class  A  has  preemptive  priority  at  that  processor; 
and: 


Class  P.  has  a  much  higher  burst  time  (much  lower  burst 
rate)  at  that  processor  than  Class  B;  or 

Class  A  has  at  least  as  high  a  burst  time  at  that 
processor  as  Class  B,  and  has  a  lower  burst  time  at  th 
other  processor  than  Class  B  has  at  the  first  processo 
or 

Class  has  a  much  lower  burst  time  at  the  other 
processor  than  Class  B  has  at  the  first  processor. 


If  the  following  conditions  are  also  satisfied,  then  the  effect 
will  be  balanced  out  and  lockout  avoided: 

Program  Class  B  has  preemptive  priority  at  the  other 
processor ;  an d 

Class  B  has  at  least  as  high  a  buret  time  at  that  processor 
as  Class  A,  and  has  a  lower  burst  time  at  the  first,  processor 
than  Class  A  has  at  the  other  processor. 


In  the  above  statements,  "much  greater"  means  a  factor  of  10^  and 
"greater"  means  a  factor  of  lO^,  As  a  more  concrete  example, 
consider  yet  again  the  burst  rate  matrix: 


Class  2 


Class  1 


■K 

d 


Dev.  C  1  a 
Dev,  1  !  c 


Bear  in  mind  that  the  following  example  is  expressed  in  terras  of 
burst  rates  rather  than  burst  times: 

’Program  class  two  will  be  "locked  out"  at  processor  zero  iff: 
q((0,C)  ,1)  >  q((0,n)  ,2)  ;  and: 
a  <<  b;  or 
a  <  b  and  c  >  b;  or 
c  >>  b. 

If  the  following  conditions  are  also _ satisfied,  then  the  effect 
will  be  balanced  out  and  lockout  avoided: 

q  (  (1,  1)  ,  1)  <  a((1,  1)  ,2)  ;  and 

d  <  c,  and  b  >  c. 


hi  (0 
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that  these  latter  conditiors  can  only  be  consistent  with  the 
first  of  the  three  earlier  conditions. 

The  above  statements  correspond  closely  to  the  intuitive 
notion  of  what  causes  a  particular  program  class  to  be  locked  out 
at  a  processor.  Assuming  that  the  other  program  class  has 
priority  at  that  processor,  it  may  monopolize  it  in  any  one  of 
several  ways.  It  may  use  the  processor  for  very  long  bursts  (the 
•^irst  condition)  ;  it  may  use  it  for  bursts  of  "reasonable” 
duration  that  occur  fairly  frequently  (the  second  condition) ;  or 
it  may  use  it  for  relatively  short  but  very  frequent  bursts  (the 
rhird  condition) . 

Dne  surprising  and  unexplained  result  is  that  while  the  class 
two  program  does  indeed  suffer  large  blowup  if  a  <  b  and  c  >  b 
(trial  3,  applying  the  device  symmetry),  difficulties  do  not 
arise  if  a  <  b  and  c  >  b  (trial  5,  applying  the  program  class 
symmetry)  . 

The  intuitive  explanation  of  this  condition  is  that  the 
program  of  class  one  spends  at  least  as  much  time  at  processor 
zero  (where  it  has  priority)  as  does  the  program  of  class  two, 
and  finishes  at  processor  one  before  the  class  two  program  is 
done  at  processor  zero,  so  returns  and  preempts  the  class  two 
program.  Logically,  one  would  suspect  that  the  class  one  program 
might  spend  more  time  at  processor  zero  than  the  class  two 
program,  and  finish  at  processor  one  at  least  as  soon  as  the 
class  two  program  finishes  at  processor  zero,  inducing  the  same 
effect.  It  is  possible  that  this  is  in  fact  the  case,  but  that 
the  transition  point  lies  somewhere  between  the  values  computed 
in  the  experiment. 

There  is  one  additional  situation  in  which  a  particular 
program  class  suffers  undue  delays:  if  the  other  class  has  a 
higher  burst  time  at  each  processor  and  also  has  priority  at  each 
processor..  This  also  makes  intuitive  sense. 


6 . 3  general  observa t ions 

Several  miscellaneous  observations  can  be  made  concerning  the 
data  in  the  previous  chapter.  In  the  introduction  to  Chapter 
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Fiv9  it  was  pointed  out  that  when  the  program  classes  have 
identical  burst  time  characteristics  the  choice  of  schedules 
cannot  affect  total  processor  utilization.  Obviously  the  same 
cannot  be  said  for  blowup  factors,  as  trials  1  and  3  in  any  of 
the  tables  indicate. 

?ls  one  might  expect,  practicina  FCFS  at  both  processors 
usually  results  in  a  high  blowup  factor  for  one  of  the  program 
classes  (Table  5.1).  Service  rate  biasing  (Table  5.2)  gives  very 
good  performance  in  most  situations,  but  is  occasionally  poor. 

While  in  general  high  blowup  factors  correspond  to  low  total 
processor  utilization  (trial  2  in  Table  5.3,  for  example), 
certain  schedules  purchase  a  small  increase  in  total  utilization 
at  the  cost  of  a  large  increase  in  the  blowup  factor  of  one 
program  class.  Trial  15  in  Table  5.2  is  an  example  of  this 
phenomenon.  While  the  relationship  between  utilization  and 
blowup  is  not  at  all  clear,  objective  functions  of  the  form 
’’maximize  total  processor  utilization  subject  to  the  constraint 
that  the  ratio  of  the  blowup  factors  be  within  a  certain  bound" 
would  be  interesting  to  investigate. 

Table  5.3  shows  that  in  most  situations  the  better  rule  of 
shortest  burst  time  first  and  relative  shortest  burst  time  first 
performs  excellently  with  respect  to  blowup  factors.  There  are 
two  exceptional  cases:  when  the  program  classes  have  similar 
characteristics  (trials  3  and  15  in  Table  5,3) ;  and  when  one 
processor  has  a  very  light  static  load  (trials  16,  19  and  21) . 

It  is  fair  to  say  that  practicing  SBT-''’SPT  at  both  processors 
provides  good  blowup  performance  along  with  maximum  total 
processor  utilization. 

6  ,  U  When  flullSiZ  ill  B 

Previous  sec’^ions  have  dealt  with  the  burst  times  (or  rates) 
of  program  classes.  The  shortest  burst  time  first  scheduling 
rule  which  has  been  discussed  is  conventionally  called  shortest 
time  first.  The  non-standard  terminology  was  used  in 
order  to  distinguish  SB"’  and  FSBT  from  the  rule  that  will  be 
discussed  in  this  section. 
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Tp  to  this  point  the  number  of  cycles  that  a  program  class 
requires  to  complete  its  processing  has  been  ignored.  The 
objective  functions  that  have  been  considered  are  insensitive  to 
this  information.  'For  the  purposes  of  this  section,  however,  it 
is  assumed  that  in  addition  to  an  expected  burst  time  at  each 
processor,  each  program  class  has  an  expected  number  of  cycles 
that  a  program  of  the  class  must  receive  before  completing 
service  and  being  replaced  by  another  program  of  the  same  class. 

Definition:  In  a  system  of  two  processors,  a  cycle  consists  of 

one  burst  time  (not  one  quantum)  of  service  at  each 
processor.  K  cycle  begins  at  processor  zero  and  ends  at 
processor  one. 

Definition:  The  completion  probability  of  a  program  class  is  the 

probability  that  a  program  of  that  class  will  complete 
service  at  the  end  of  a  cycle.  It  is  the  inverse  of  the 
expected  number  of  cycles  for  that  program  class. 


Notation:  The  completion  probability  of  a  program  class  will  be 

denoted  by  c(<class>). 
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Definition:  The  processing  time  of  a  program  class  is  the 

product  of  its  expected  basic  cycle  time  and  its  expected 
number  of  cycles.  (A.ctually,  it  is  the  mean  of  the 
convolution  o:^  these  two  distributions,  but  this  definition 
will  suffice. ) 


The  scheduler  should  attempt  to  maximize  the 
"collecting”  the  largest  possible  reward. 


objective  function. 


Particularly  when  the  reward  is  uniform  across  program 
classes,  a  conventional  "enlightened"  scheduler  might  give 
priority  at  each  processor  to  the  program  class  with  the  shortest 
processing  tiine  in  an  effort  to  complete  programs  as  rapidly  as 
possible  and  maximize  the  reward. 

Definition:  The  shortest  processing  time  first  (SPT)  discipline 

gives  full  priority  at  a  processor  to  the  program  class  with 
the  shortest  processing  time. 

It  is  obvious  that  from  the  point  of  view  of  total  processor 
utilization,  practicing  SP'^  at  both  processors  might  in  some 
circumstances  select  the  worst  of  the  full  priority  schedule 
pairs.  One  would  suspect  that  in  this  case  SPT  might  result  in 
such  poor  total  processor  utilization  that  global  use  of  SBT-FSBT 
would  be  preferable  even  according  to  the  program-oriented 
objective  function  that  has  been  defined.  This  section  provides 
a  relatively  superficial  exploration  of  this  complex  problem. 

In  the  system  described  above,  a  program  that  has  completed 
its  burst  at  processor  one  moves  back  to  processor  zero  with 
orobability  1  -  c(<class>).  With  probability  c  (<class>)  it 
leaves  the  system,  and  is  replaced  after  some  time  interval  by  a 
program  from  the  same  program  class.  Since  a  closed  cyclic  model 
will  continue  to  be  used,  additional  states  are  introduced  to 
represent  the  transition  between  programs.  Assuming  that  the 
program  replacement  time  is  not  instantaneous.  Figure  6.1  is  the 
■Pull  state-transition  representation  of  such  a  model. 

In  this  representation  the  letter  X  in  a  state  designation 
means  that  the  program  belonging  to  the  indicated  class  has 
completed  and  is  in  the  process  of  being  replaced.  The  "rate" 
u(X,<class>)  indicates  how  rapidly  this  replacement  occurs. 

Fortunately,  life  is  made  a  great  deal  easier  by  the 
assumption  that  programs  that  have  completed  are  instantaneously 
replaced.  In  other  words,  u(X,’^')  is  infinite.  This  assumption 
is  not  totally  unrealistic,  and  it  causes  the  five  "extra"  states 
to  become  superfluous  (i. e.  Prf  (X,C)  ]  =  Pr[  (X,1)  ]  =  Pr[  (D,X)  ]  = 
Prr(1,X)]  =  Prr(X,X)1  =  0),  "figure  6.1  thus  collapses  into  the 
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four-state  representation  of  Figure  4.3  and  the  computation  of 
processing  time  is  handled  by  a  simple  multiplicative  fastor. 

Table  6.1  presents  the  results  of  an  experiment  designed  to 
compare  SPT  to  SBT-RSBT.  Six  trials  were  conducted,  representing 
the  six  non- symmetrical  permutations  of  a  rate  matrix  with  fnur 
distinct  burst  rates.  ‘p’actors  of  two  rather  than  factors  of  100 
were  used.  For  each  rate  set,  the  two  program  classes  take  on 
seven  non-redundan t  pairs  of  cycle  counts,  and  the  relative 
performance  of  SPT  and  SBT-PSBT  practiced  at  both  processors  is 
displayed.  The  better  schedule  (according  to  total  processor 
utilization)  of  SPT  and  RSBT  is  automatically  selected  according 
to  the  criteria  presented  in  Section  5. 4. 

For  each  pair  of  cycle  counts  the  PROCessing  TIME  for  each 
program  class  is  first  shown.  Five  values  are  then  displayed  for 
each  of  the  two  globally- applied  scheduling  rules.  First  is 
total  PROCessor  U'^ILization  .  The  SBT-RSBT  rule  obviously 
dictates  the  same  schedule  for  every  subcase  in  a  particular 
trial,  since  it  depends  only  upon  burst  rates  and  it  is  the 
processing  time  that  is  being  varied.  Therefore  the  utilization 
for  this  rule  remains  constant  throughout  a  trial.  SPT  can 
dictate  only  two  schedules  (since  it  will  give  full  priority  to 
the  same  program  class  at  each  nrocessor) ,  so  only  two  different 
utilizations  occur  for  it  in  each  trial. 

The  REWARD  WULTiplier  for  a  class,  shown  next  in  the  tabl 
indicates  how  many  programs  of  that  class  complete  per  unit  t 
under  the  indicated  scheduling  rule.  To  compute  the  total  re 
when  an  arbitrary  reward  is  declared  for  the  completion  of  a 
program  of  each  class,  it  is  merely  necessary  to  take  the  pro 
of  each  class*  reward  with  that  class'  reward  multiplier  and 
the  two  values. 

finally,  the  TOTAL  RFWA.PD  is  shown  for  two  "reasonable"  ways 
of  computing  the  reward:  a  DNTT  reward  for  the  completion  of  any 
program;  and  a  BIASED  reward,  proportional  to  the  processing  time 
of  the  proaram  cl  ass. 

Consider  trial  2  as  an  example  of  the  behavior  of  the  two 
rules.  Recall  from  Section  5.3  that  the  pair  (i,j)  indicates 
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TRIAL  NO,  1 

RATE  set:  _ 


DEV 

.  01 

] 

2 

DEV 

.  1  1 

4 

8 

NO.  OF 

CYCLESI 

PROC. 

time  I 

1  1 

1  PROC. 

1  reward 

mult, 1 

total 

reward 

£Ul l-.£Lx2 iaLjJL i 

1  SCHEDUl  E  1 

LUIILxl£LtL,l- 

.!:Lj>^_U!illI_LaiAS£2 

1 

1 

1.25 

0,62 

SPT 

60* 

0.168 

1.600 

1.768 

1.210 

sbt-rsbt 

60% 

0,168 

1,600 

1,768 

1.210 

1 

2 

1.25 

1,2b 

SPT 

57% 

0.800 

0.128 

0,928 

1.160 

sbt-rsbt 

60% 

0,168 

0.800 

0.968 

1,210 

1 

4 

1.25 

2,50 

SPT 

57% 

0.800 

0.064 

0.864 

1,160 

SPT-RSBT 

60% 

0,168 

0.400 

0,568 

1,210 

1 

8 

1.25 

5,00 

SPT 

57% 

0,800 

0.032 

0.832 

1.160 

SBT-RSBT 

60% 

0.168 

0.200 

0.368 

1.210 

2 

1 

2.50 

0,62 

SPT 

60% 

0.084 

1,600 

1.684 

1.210 

srt-rsbt 

60% 

0.08^ 

1.600 

1.684 

1.210 

4 

1 

5.00 

0,62 

SDT 

60% 

0,042 

1,600 

1,642 

1.210 

SBT-RSBT 

60% 

0.042 

1.600 

1,642 

1.210 

8 

1 

10.00 

0.62 

SPT 

60% 

0.021 

1.600 

1.621 

1.210 

SBT-RSBT 

60% 

0,021 

1.600 

1,621 

1.210 

trial  no.  2 

RATE  set: 

DEV.  01 
DEV.  11 

NO,  OF  CYCLESIPROC. 
£La1  1  ICL.i  1 

_ !£Lx_a 

4  2 

1  8 

time  I  IPROC. 

CL.2J  SCHEDULE 1 UTIL. 

1  reward 

ICL.l  1 

mult. 

CL. 2 

itotal 

lUfJiT  1 

reward 

BJLASEJi 

1 

1 

1.25 

0.62 

SPT 

69% 

0,320 

1,600 

1.920 

1.400 

srt-rsbt 

83% 

0,666 

1.333 

2,000 

1.666 

1 

2 

1.25 

1,25 

SPT 

62% 

0,800 

0,200 

1.000 

1,250 

srt-rsbt 

83% 

0.666 

0.666 

1.333 

1,666 

1 

4 

1.25 

2.50 

SPT 

62% 

0,800 

0,100 

0,900 

1.250 

srt-rsbt 

83% 

0,666 

0.333 

1.000 

1.666 

1 

8 

1.25 

5,00 

SPT 

62% 

0.800 

0,050 

0.850 

1.250 

srt-rsbt 

83% 

0,666 

0.166 

0,833 

1.666 

2 

1 

2.50 

0.62 

SPT 

69% 

0.160 

1.600 

1.760 

1.400 

srt-rsbt 

83% 

0.333 

1.333 

1,666 

1 . 666 

4 

1 

5.00 

0.62 

SPT 

69% 

0.080 

1,600 

1.680 

1.400 

SBT-RSBT 

83% 

0,166 

1.333 

1,500 

1.666 

8 

1 

10.00 

0.62 

SPT 

69% 

0.040 

1.600 

1,640 

1.400 

SRT-RSBT 

83% 

0.083 

1.333 

1,416 

1.666 

Table  6.1-1 
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TRIAL  NO.  3 

RATE  SET!  ^_CLx-l _ CLx_2. 


DEV 

.  ni 

2 

4 

DEV 

.  1 1 

1 

8 

NO.  OF 

CYCLESI 

PROC. 

time 

1  1 

1  PROC. 

1  reward 

mult. I 

1  total 

reward 

J 

J 

i_eL.21SChEDULEJ.UTIL.  ICL.l  1 

1 

1 

1 .50 

0.37 

SPT 

729<; 

0.296 

2.666 

2.962 

1.444 

SBT-RSBT 

75% 

0.500 

2.000 

2.500 

1.500 

1 

2 

1.50 

0.75 

SPT 

72% 

0.296 

1.333 

1.629 

1.444 

sbt-psbt 

75% 

0.500 

1.000 

1.500 

1.500 

1 

4 

1.50 

1.50 

SPT 

58% 

0.666 

0.111 

0.777 

1.166 

spt-rsbt 

75% 

0.500 

0.500 

1.000 

1.500 

1 

8 

1.50 

3.00 

SPT 

58% 

0.666 

0.055 

0.722 

1.166 

sbt-rsbt 

75% 

0.500 

0.250 

0.750 

1.500 

2 

1 

3.00 

0.37 

SPT 

72% 

0.148 

2.666 

2.814 

1.444 

sbt-rsbt 

75% 

0.250 

2.000 

2.250 

1.500 

4 

1 

6.00 

0.37 

SPT 

72% 

0.074 

2.666 

2.740 

1.444 

sbt-rsbt 

75% 

0.125 

2.000 

2.125 

1.500 

8 

1 

12.00 

0.37 

SPT 

72% 

0.037 

2.666 

2.703 

1.444 

sbt-rsbt 

75% 

0.062 

2.000 

2.062 

1.500 

trial 

NO. 

4 

RATE 

set: 

-LLjl-I- 

_ 

DEV 

.  01 

2 

1 

DEV 

.  11 

4 

8 

NO.  OF  CYCLESI 

PROC. 

TIME  1 

1 

IPROC. 

1  REWARD 

MULT. 1 

1  total 

reward 

Lx  LJ_  C  L  OL  id  - 

i-Ll^dli£il£DUL£iyULxlLLxi-i,CLx£  1  iii^II-JLaiAS£]2 

1 

1 

0.75 

1.12 

SPT 

64% 

1  .333 

0.253 

1.587 

1.285 

SBT-RSBT 

65% 

1.280 

0.320 

1  .600 

1.320 

1 

2 

0.75 

2.25 

SPT 

64% 

1.333 

0.126 

1  .460 

1.285 

spt-rsbt 

65% 

1.280 

0.160 

1.440 

1.320 

1 

4 

0.75 

4.50 

SPT 

64% 

1.333 

0.063 

1.396 

1.285 

SPT-RSBT 

65% 

1.280 

0.080 

1.360 

1.320 

1 

8 

0.75 

9.00 

SPT 

64% 

1.333 

0.031 

1.365 

1.285 

sbt-rsbt 

65% 

1.280 

0.040 

1.320 

1.320 

2 

1 

1.50 

1.12 

SPT 

56% 

0.084 

0.888 

0.973 

1.126 

SPT-RSBT 

65% 

0.640 

0.320 

0.960 

1.320 

4 

1 

3.00 

1.12 

SPT 

56% 

0.042 

0.888 

0.931 

1.126 

sbt-rsbt 

65% 

0.320 

0.320 

0.640 

1.320 

8 

1 

6.00 

1.12 

SPT 

56% 

0.021 

0.888 

0.910 

1.126 

SBT-RSBT 

65% 

0.160 

0.320 

0.480 

1.320 

Table  6.1-2 
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T«IAL  NO,  5 

RATE  set:  _ £Lx_i 


DEV.  01 
OEV,  11 

NO.  OF  CYCLESIPROC. 
CL.  11  CL^_LCL.J.  1 

4 

2 

time  I 

CL.  21 

1 

8 

1 

LSCHEDULi. 

IPROC. 

iUTJ.L. 

IREWARD 
ICL.l  .1 

MULT. 

CL.2 

itotal 

lUNIT 

reward 

•BIASED 

1 

1 

0.75 

1.12 

SPT 

71% 

1  .333 

0.380 

1.714 

1.428 

sbt-rsbt 

80% 

1.230 

0.615 

1.846 

1.615 

1 

2 

0.75 

2.25 

SPT 

71% 

1  .333 

0.190 

1.523 

1.428 

sbt-rsbt 

80% 

1.230 

0.307 

1.538 

1.615 

1 

4 

0.75 

4.50 

SPT 

71% 

1.333 

0.095 

1.428 

1.428 

sbt-rsbt 

80% 

1.230 

0.153 

1.384 

1.615 

1 

8 

0.75 

9,00 

SPT 

71% 

1  .333 

0.047 

1.380 

1.428 

sbt-rsbt 

80% 

1.230 

0.076 

1.307 

1.615 

2 

1 

1  .50 

1,12 

SPT 

59% 

0.126 

0.888 

1.015 

1.190 

SRT-PSBT 

80% 

0,615 

0.615 

1.230 

1.615 

4 

1 

3.00 

1,12 

SPT 

59% 

0,063 

0.888 

0.952 

1.190 

sbt-rsbt 

80% 

0.307 

0.615 

0.923 

1.615 

8 

1 

6.00 

1 .12 

SPT 

59% 

0.031 

0.888 

0.920 

1.190 

sbt-rsbt 

80% 

0,153 

0.615 

0.769 

1.615 

TRIAL  NO.  6 
RATE  set: 

DEV 

DEV 

.  01 

.  1  1 

2  6 

1  4 

NO. 

OF  CYCLESIPROC. 

TIME  1  IPROC. IREWARD 

Lj;u^^HEDyLU.yuuiCLxi-i. 

MULT.  (TOTAL  REWAl^D 
.£L^_mixi_iaiASEC 

1 

1 

1.50 

0,37 

SPT 

sbt-rsbt 

68% 

68% 

0.242 

0.242 

2.666 

2.666 

2.909 

2.909 

1.363 

1.363 

1 

2 

1 .50 

0.75 

SPT 

sbt-rsbt 

68% 

68% 

0.242 

0.242 

1.333 

1.333 

1  .575 
1.575 

1.363 

1.363 

1 

4 

1.50 

1  .50 

SPT 

sbt-rsbt 

57% 

68% 

0.666 

0,242 

0.102 

0.666 

0.769 

0.909 

1.153 

1.363 

1 

8 

1.50 

3.00 

SPT 

sbt-rsbt 

57% 

68% 

0.666 

0.242 

0.051 

0.333 

0.717 

0.575 

1.153 

1.363 

2 

1 

3.00 

0.37 

SPT 

sbt-rsbt 

68% 

68% 

0.121 

0.121 

2.666 

2.666 

2.787 

2.787 

1.363 

1.363 

4 

1 

6.00 

0.37 

SPT 

sbt-rsbt 

68% 

68% 

•• 

0.060 

0.060 

2.666 

2.666 

2.727 

2.727 

1.363 

1.363 

8 

1 

12.00 

0.37 

SPT 

sbt-rsbt 

68% 

68% 

0.030 

0.030 

2.666 

2.666 

2.696 

2.696 

1.363 

1.363 

Table  6. 1 
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that.  proqram  class  i  has  full  preemptive  priority  at  processor 
2ero  (i.e.  c((0,C),i)  =  1)  and  program  class  j  has  full 

preemptive  priority  at  processor  one. 

In  trial  2,  SPT-PSBT  dictates  the  schedule  (1r2) .  The  basic 
cycle  time  of  cLass  one  is  1.250;  the  basic  cycle  time  of  class 
two  is  0.625.  When  class  one  and  class  two  each  cycle  the  same 
number  of  times  before  terminating  (the  first  subcase),  SPT 
dictates  the  schedule  (2,2)  with  a  resultant  decrease  in  both 
utilization  and  reward  compared  to  SBT-PSBT.  Since  SBT-RSBT 
gives  priority  to  a  different  program  class  at  each  processor  in 
trial  2,  SPT  will  always  yield  lower  utilization. 

A.S  the  number  of  cycles  for  program  class  two  is  increased 
(subcases  two  through  four),  the  processing  time  of  class  two 
exceeds  that  of  class  one  and  SPT  dictates  the  schedule  (1,1) . 
SBT-RSBT  produces  a  higher  unit  reward  until  the  fourth  subcase, 
when  the  prpcessing  time  for  class  two  is  four  times  that  of 
class  one.  ^^t  this  point,  SPT  becomes  preferable.  SBT-RSBT 
continues  to  give  better  performance  according  to  the  rate-biased 
reward . 

Subcases  five  through  seven  involve  an  increase  in  the  number 
of  cycles  for  program  class  one.  SPT  dictates  the  schedule  (2,2) 
as  in  subcase  one,  and  gives  better  performance  than  SBT-RSBT 
according  to  the  unit-reward  objective  function. 

The  other  trials  follow  a  similar  pattern.  In  portions  of 
trials  1  and  6  the  two  rules  dictate  the  same  schedule  and 
perform  identically.  Fxcept  for  these  instances,  SBT-RSBT  always 
performs  better  according  to  the  rate-biased  reward  because  it 
minimizes  average  blowup  factor.  SBT-RSBT  performs  better 
according  to  the  unit-reward  objective  function  until  one  program 
class'  processing  time  is  larger  than  the  other's  by  a 
substantial  factor  (e.g.,  h  in  trial  2). 

fairly  strong  conclusion  may  be  drawn  from  this  preliminary 
data:  that  givinc  priority  to  the  program  class  with  the 
shortest  processing  time,  while  ignoring  burst  times  and 
processor  utilization,  is  a  dangerous  policy  even  according  to 
program-oriented  objective  functions. 
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Conclusions  and  extensions 
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This  thesis  has  attempted  to  establish  firm  intuitive  and 
semi-formal  foundations  for  studying  the  effi^cient  scheduling  of 
computer  systems  with  multiple  scarce  resources.  These 
foundations  were  laid  by  a  thorough  exploration  of  the  scheduling 
phenomena  that  arise  in  systems  with  two  processors  and  two 
active  programs. 

The  stochastic  model  used  in  this  analysis  presumes  that 
successive  burst  intervals  for  a  particular  program  class  at  a 
particular  processor  may  be  selected  from-  the  same  exponential 
distribution.  Although  little  empirical  evidence  exists 
r^^garding  this  assumption,  it  is  probably  not  correct.  The 
accuracy  of  the  assumption  is  not  critical  to  the  study  of 
program  interaction  effects,  but  certainly  has  a  bearing  on 
optimality  results. 

The  first-come-first- served,  processoir  sharing,  and  service 
rate  biasing  scheduling  disciplines  were  first  compared  to  one 
another  numerically  with  respect  to  total  processor  utilization. 
In  general,  the  rule  making  the  most  use  of  burst  time 
information  (SEB)  performed  best. 

From  a  practical  standpoint,  scheduling  rules  that  reguire 
preemption  no  more  freguently  than  when  a  program  requests 
service  at  a  processor  are  desirable.  Processor  sharing  and 
service  rate  biasing  do  not  satisfy  this  criterion.  The  full 
priority  disciplines,  which  meet  this  objective,  were  defined. 

A  proof  was  presented  that  some  pair  of  full  priority 
schedules  is  sufficient  to  achieve  maximum  total  processar 
u-*-ilization  in  a  2P2C  system.  An  effort  was  then  made  to  find  a 
consistent  scheduling  rule  that  would  generate  the  optimal  full 
priority  schedule  pair  when  used,  globally.  The  conjecture  that 
scheduling  shortest  burst  time  first  at  each  processor  would 
suffice  was  shown  to  be  false.  The  specific  sets  of  proaram 
burst  rates  for  vrhich  the  rule  failed  had  been  previously  noted 
as  producing  anomalous  behavior  with  ^CFS,  PS  and  SRB.  It  was 


-93 


not  possible  to  state  precisely  the  conditions  under  which  th 
behavior  arose,  however. 

A  new  rule,  relative  shortest  burst  time  first,  was  defin 
While  PSBT  suffers  from  the  same  shortcomings  as  SBT,  it  was 
proven  that  the  union  of  the  two  disciplines,  a  rule  that  sel 
the  "better"  of  the  schedule  pairs  specified  by  the  two  globa 
applied  rules,  is  optimal  with  respect  to  total  processor 
utilization. 


The  various  scheduling  rules  were  next  compared  with  resp 
to  a  program-oriented  objective  function,  program  class  run  t 
expansion  factor  ("blowup"  factor) .  It  was  shown  that  proces 
sharing  always  equalizes  the  blowup  factors  of  the  two  progra 
classes.  The  SBT-FSBT  rule  was  seen  to  perform  very  well 
according  to  this  criterion.  Conditions  were  stated  under  wh 
the  blowup  factor  of  a  particular  program  class  becomes 
excessive.  These  conditions  serve  to  formalize  the  ways  in  w 
various  schedules  allow  one  program  class  to  interfere  with  t 
progress  of  another. 

Finally,  it  was  presumed  that  the  programs  of  each  class 
cycle  finitely  in  the  system,  and  the  objectiv^e  function  was 
changed  to  one  based  upon  a  "reward"  offered  for  the  completi 
o:<^  each  program.  A  somewhat  surprising  result  was  shown:  th 
even  with  respect  to  this  objective  function  it  is  very  often 
preferable  to  schedule  according  to  burst  times  rather  than  t 
processing  times. 

A  natural  extension  would  be  the  application  of  these 
insights  to  more  complex  environments.  A  jump  from  two  to  th 
processors  and  active  programs  would  probably  yield  significa 
results;  the  costs  in  terms  of  both  analysis  and  general 
confusion  of  going  beyond  that  point  probably  outweigh  the 
benefits. 


The  easiest  extension  from  the  point  of  view  of  comprehen 
would  be  to  add  a  new  program  class  with  characteristics 
identical  to  those  of  one  of  the  two  existing  classes.  Next, 
characteristics  of  this  third  class  could  be  varied  so  that  t 
system  contains  three  distinct  program  classes. 
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Tt  is  also  simple  to  a^d  additional  processors  to  the  model, 
’^^n-lerson  (Section  3.*^)  considers  a  CPU  and  three  peripheral 
processors,  and  includes  peripheral  processor  selection 
probabilities  for  each  program  class.  The  combinatorics  of  a 
factorial  experiment  are  already  out  of  hand,  and  without  some 
intuitive  grasp  of  program  interactions,  confusion  reigns.  It  is 
in  understanding  this  sort  of  complex  behavior  that  the  results 
of  this  thesis  should  be  of  value. 

In  addition  to  the  above  extensions,  several  related  issues 
come  to  mind  that  should  be  explored.  In  particular: 

1.  A^ccurate  burst  time  information  is  needed  by  the  SBI-RSBI 
discipline.  How  could  this  information  be  obtained  in  an  actual 
svstem?  Pmpirical  data  such  as  that  presented  in  [ Sherman  1972b] 
must  be  analvxed  to  see  if  programs  tend  to  exhibit  serial  burst 
time  correlation  on  various  processors.  Could  a  ’’smart'*  compiler 
accurately  and  inexpensively  predict  object  program  burst  tines? 

2.  t  is  clear  that  scheduling  rules  that  rely  on  burst  time 
information  can  yield  better  performance  than  less  "intelligent” 
rules.  It  is  also  clear  that  the  performance  of  these  rules  is 
dependent  upon  the  accuracy  of  the  burst  time  information  that  is 
supplied.  This  relationship  needs  to  be  quantifi-ed  in  order  to 
indicate  the  robnisdiness  of  the  various  rules.  How  inaccurate  can 
burst  time  information  (however  derived)  be  before  the 
performance  of  a  particular  sophisticated  rule  falls  below  that 
of  a  simple  discipline? 

3.  While  it  is  important  to  distinguish  between  burst  time  and 
processing  time.  Section  6.5  shows  that  both  of  these  factors  are 
of  importance  to  some  objective  functions.  Can  a  tradeoff 
be+woen  the  two  be  arrived  at  in  terms  of  weight  attached  and 
accuracy  of  information? 

4.  '^he  relationship  between  total  processor  utilization  and 
program  blowup  factors  is  not  at  all  clear  from  the  data  in  this 
thesis.  Objective  functions  of  the  form  "maximize  total 
processor  utilization  subject  to  the  constraint  that  the 
the  blowup  factors  be  within  a  certain  bound"  would  be 
interesting  to  investigate. 


ratio  of 
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[Abell  1970] 

V.A,  A.bell,  S,  Rosen  and  R.F.  wagner,  "Scheduling  in  a 
General  Purpose  Operating  System,"  Proc*  1970  FJCC,  pp.  39- 
96. 

Pescribes  f^ACF,  a  system  for  the  CDC  6500.  Up  to  26 
control  points  are  provided,  versus  8  under  the  standard 
Chippewa  sys-*:em.  Automatic  priority-based  ro llin/rollout  is 
provided.  It  appears  that  the  small  number  of  control  points 
and  their  non-preemptible  nature  were  the  undoing  of 
Chippewa. 


rA.nderson  1973  ] 

H.A.  Anderson,  Jr.,  "Approximating  Preemptive  Priority 
Dispatching  in  a  Multiprogramming  Model,"  IBM  J.  Res. 
Develop.  17,6,  pp .  533-539. 

A  closed  queueing  model  is  developed  that  consists  of  a 
central  processor,  U  distinguishable  programs  and  M 
peripheral  processors.  Each  program  has  its  own  central 
processor  service  rate,  peripheral  processor  service  rates, 
and  peripheral  processor  selection  probability  function. 
Priority  at  various  resources  is  established  by  probability 
functions  and  solution  is  through  Chandy's  local  balance 
t  echnique . 

The  approach  shows  promise  of  being  applicable  to 
realistic  systems  and  interesting  scheduling  rules. 


[Baskett  1971] 

F.  Baskett,  "Mathematical  Models  of  Multiprogrammed  Compjter 
Systems,"  TSM-17,  Univ.  of  Texas  at  Austin  Computation 
Center,  January  1971. 

An  excellent  study  of  a  series  of  queueing  models,  each 
of  which  includes  some  of  the  following  characteristics: 
hy poexponentially  distributed  I/O  service  times; 
hyperexponent  ially  distributed  CPU  service  times;  'p'CFS  or  RR 
CPU  scheduling;  distinguishable  peripheral  processors. 
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r  Baskett  1972  1 

F.  Baskett,  ''The  Fependence  of  Computer  System  Queues  Upon 
Processing  Time  Distribution  and  Central  Processor 
Scheduling,"  STGOPS  6,1S2,  pp.  109-113. 

By  using  the  method  of  stages  to  model  complex 
distributions,  finite-  and  infinite-source  processor  sharing 
models  are  compared  to  a  FCFS  discipline.  Schrage’s 
conjecture  is  confirmed;  With  coefficient  of  variation  less 
^han  one,  W  (FCFS)  is  less  than  W  (PS)  ;  coefficient  of 
variation  greater  than  one  reverses  the  situation. 


r  ^ern stein  1971] 

A..J.  Bernstein  and  J.C.  Sharp,  "A.  Policy-Driven  Scheduler  for 
a  Time-Sharing  System,"  CA.CM  14,2,  pp.  74-78. 

A  General  Flectr ic-modif ied  scheduler  that  attempts  to 
minimize  overhead  is  described.  A  process  is  defined  as 
critical  if  its  deadline  is  approaching.  Wo  non-critical 
process  may  preempt  a  critical  process.  Varying  degrees  of 
hysteresis  may  be  specified  as  bias  in  favor  of  the  executing 
process. 

r^hat  1973  ] 

U.N.  Bhat  and  P.F.  Nance,  "Dynamic  Quantum  Allocation  and 
Swap-Time  Variability  in  Time-Sharing  Operating  Systems,"  CP- 
73009,  Southern  Nethodist  Univ.  ,  A.pril  1973. 

"Fffective  C^U  utilization"  is  defined  to  be  the 
proportion  of  CPU  busy-period  devoted  to  problem  program 
processing;  idle  time  is  ignored.  A  Markov  model  is  used  to 
demonstrate  that  a  policy  of  decreasing  guantum  length  with 
increasing  load  provides  improved  utilization. 

The  definitions  and  notation  conceal  the  fact  that  CPU 
utilization  in  a  conventional  sense  is  quite  low  (below  5Q%) 

^ or  those  cases  where  a  high  rate  of  preemption  is  found 
d  esirable . 


rBrinch  Hans'en  1o72] 

P.  Erinch  Hansen,  "Short-Term  Scheduling  in  Multiprogramming 
Systems,"  STGOPS  f,1S2,  pp.  101-105. 

Defines  the  process  control  primitives  used  in  the  PC- 
4000  system,  and  claims  that  their  execution  is  rapid  enough 
that  they  car.  be  used  for  medium-term  scheduling. 
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r  BrowP-P  19721 

J.C.  Prowne,  J.  Lan  and  F.  Baskett,  ’’The  Interaction,  of 
Multiprogramming  Job  Scheduling  and  CPU  Scheduling,"  Proc, 
1972  FJCC,  pp.  13-21. 

Using  a  probabilistic  simulation  model  of  the  UT-1  and 
UT-2  systems  for  the  CDC  6600,  the  authors  explore 
interactions  between  job  scheduling  (or  macro-scheduling)  and 
CPU  scheduling  upon  several  objective  functions. 

SPT  job  scheduling  and  PP  processor  scheduling  are  found 
best  by  a  factor  of  two  for  throughput  (number  of  jobs 
completed  in  a  fixed  time  interval) .  Since  the  job  mix  is 
heavily  CPU-bound,-  the  relatively  slight  effect  of  processor 
scheduling  upon  the  chosen  objective  function  is  not  terribly 
surprising. 

Scheduling  the  processor  shortest  burst  time  first  is 
found  to  prpvide  the  best  CPU  utilisation;  PR  provides  the 
best  I/O  utilisation. 


[Bunt  19741 

R.B.  Bunt,  "Self-Pegulating  Schedulers  for  Operating 
Systems,"  Ph.P,  Thesis,  Univ.  of  Toronto,  1974  (in 
preparation) . 

A  discussion  of  adaptive  schedulers  that  react  by 
modifying  the  scheduling  discipline  as  load  varies. 
Simulation  results  are  presented  for  a  single-processor 
system,  that  uses  variations  of  a  round-robin  strategy. 


[ Buzen  1 97 1 j 

J.P.  Buzen,  "Queueing  Network  Models  of  Multiprogramming," 
ESD-TP-71-345 ,  Harvard  Univ.,  August  1971. 

Develops  the  "central  server"  model  of  multiprogramming. 
Any  number  of  different  peripheral  processors  are  accounted 
for.  All  programs  must  have  the  same  characteristics, 
however. 

The  model  is  applied  to  three  problems:  optimal  buffer 
size;  PPU  utilization;  and  page  traffic  balancing. 

Contains  an  excellent  bibliography  and  survey  of 
queueing  theory  as  applied  to  computer  systems. 


rcodd  1960] 

E.F.  Codd ,  "Multiprogram  Scheduling,"  CACM  3,6S7,  pp.  347-350 
S  413-418. 

In  this  ancient  article,  placement  rules  are  proposed 
for  multiprogramming  systems  with  all  jobs  initially 
available. 
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^ Coffman  1968] 

F.G.  Coffman,  Jr.,  and  L.  Kleinrock,  "Computer  Scheduling 
Methods  and  their  Countermeasures,"  Proc.  1963  SJCC,  pp,  11- 
21. 

A  general  survey  of  single-resource  scheduling 
disciplines.  The  proposed  countermeasures  are  generally 
obviours;  how  to  prevent  their  use  is  considerably  more 
complex. 


r  Cof  f  man  1973  ] 

^.G.  Coffman,  Jr.,  and 
Theory,  Frentice-Pall , 


P.J.  Denning, 
1973. 


Systems 


This  recent  book  represents  the  first  attempt  at 
comprehensive  treatment  of  formal  methods  for  the  ana 
computer  systems.  Aifter  a  general  discussion  of  cone 
processes,  both  deterministic  and  probabilistic  model 
central  processor  scheduling  are  studied.  Rotating  d 
scheduling  is  discussed,  and  there  is  a  brief  section 
models  of  T/0  -  CPP  interaction.  Denning's  influence 
in  the  two  chapters  devoted  to  memory  management.  Th 
goes  a  Iona  way  towards  filling  the  need  for  a  compre 
text  on  compudier  system  scheduling  and  formal  systems 
analysis.  A.  certain  amount  of '  thoroughness  is  necess 
sacrificed  for  the  sake  of  this  comprehensiveness,  ho 


a 

lysis  of 
urrent 
s  of 
evice 
on 

is  felt 
is  book 
hensive 

arily 
wever . 


r  Conway  1 967  ] 

P.W.  Conway,  V.L.  Maxwell  and  L.W.  Miller,  Theory  of 
P-ddison-Wesley ,  1967, 

The  fundamental  reference  work  on  job  shop  scheduling. 


r  Denning  1968  ] 

P.J.  Denning,  "Resource  ?^..llocat ion  in  Multiprocess  Computer 
Systems,"  M?\C-TP-50,  MIT,  May  1968. 


a 

g 

h 

a 
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This  grandfather  of  the  working  set  model  papers 
iscusses  various  aspects  of  maintaining  system  balance.  The 
eneral  scheduling  method  proposed  is  to  "macro-select"  jobs 
ased  upon  memory  contention  criteria,  then  sub-select 
ccording  to  processing  demands.  The  algorithm  is  shown  to 
e  isomorphic  to  the  multidimensional  knapsack  problem. 


r  Denning  1 97C ] 

P.J.  Denning,  "Virtual  Memory,"  Comp.  Surveys  2,3,  pp.  153- 

1P0. 

A.  -^airly  comprehensive  survey,  including  sections  on 
working  set  scheduling  principles.  Much  of  multiple- resource 
scheduling  stems  from  attempts  to  cope  effectively  with 
virtual  memory. 


[ Di jk stra  1959  ] 

F.W.  Dijkstra,  "Communication  with  an  Automatic  Computer," 
Ph.D.  Thesis,  Univ.  of  Amsterdam,  October  1959. 

"De  XI  is  een  automatische  rekenmachine,  die  ontworpen 
is  met  het  doel  voor  ogen  een  machine  te  construeren,  die 
zowel  voor  wetenschappeli jk  als  voor  admin istratief  rekenwerk 
goed  gebruikt  kan  worden." 

This  thesis  is  of  absolutely  no  relevance  to  the  subject 
of  this  bibliography,  but  I  was  told  that  people  check  for 
Dijkstra  references  when  perusing  a  thesis  in  the  systems 
area.  I  guess  st  least  some  people  do! 


r  Doherty  1970  ] 

W.J.  Doherty,  "Scheduling  TSS/360  for  Responsiveness,"  Proc. 
1970  FJCC,  pp.  97-112. 


An  in-bouse  version  of  TSS  was  modified  to  establish  a 
time  slice  inversely  proportional  to  working  set  size.  TSS 
is  of  special  interest  because  of  its  table-driven  scheduler. 


rnamlet  1973] 

R , G.  Hamlet,  "Ffficient  Multiprogramming  Resource  Allocation 
and  Accounting,"  CA.CM  16,6,  pp.  337-343. 

Distinguishes  between  time-sharing  and  multiprogramming 
by  claiming  that  the  former  attempts  to  equalize  or  maximize 
"response",  while  the  latter  attempts  to  maximize  resource 
utilization.  Points  out  the  importance  of  distinguishing 
between  short-term  and  long-term  scheduling  when  considering 
utilization. 


[ lavender  1968] 

J.W.  Favender,  "^.voiding  Deadlock  in  Multitasking  Systems," 
IPM  Sys.  J.  '^,2,  pp.  7a-84. 

CS/360  M7T  employed  a  multi-level  resource  allocation 
scheme,  "nesting"  allocations  so  that  those  resources 
allocated  first  would  be  released  last. 


[ Hellerman  1969] 

H.  Hellerman,  "Some  Principles  of  Time-Sharing  Scheduler 
Strategies,"  IBM  Sys.  J.  8,2,  pp.  94-117. 

Proposes  as  a  scheduler  metric  the  harmonic  mean  of  the 
ratio  of  stand-alone  program  execution  time  to  program  flow 
time. 
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r  Hellerraan  1 9*70  ] 

B.  Hellerman  and  F.J.  Smith,  "Throughput  Analysis  of  Some 
Idealized  Input,  Output  and  Compute  Overlap  Configurations," 
Comp.  Surveys  2,2,  pp,  111-118. 

Fguations  are  presented  that  describe  the  maximum 
attainable  throughput  with  a  single  job  class  and  one  of  the 
following  overlap  configurations: 

1.  no  overlap 

2.  input  and  output  overlapped  singly  with  computing 

3.  (2)  plus  input  overlapped  with  output 
ti.  full  3-way  overlap. 


rnillier  1967  ] 

F.S.  Hillier  and  G.J.  Lieberman,  Introduction  to  Operations 
Research,  Holden-Day,  1967. 

Includes  a  review  of  probability  theory  and  a 
substantial  section  on  probabilistic  modelling. 


r  Karp  1972] 

P.W.  Karp,  "P educibil ity  Among  Combinatorial  Problems,"  in 
Complexity  of  Computer  Computations,  R.F.  Miller  and  J.W, 
Thatcher,  eds.  ,  Plenum  Press,  1972. 


Discusses  the  implications  of  polynomial  completeness 
for  combinatorial  problems.  Included  in  this  class  is  the 
multidimensional  Knapsack  problem  which  is  isomorphic  to 
optimal  multiple-resource  scheduling. 


rKleinrock  1970a] 

L.  Kleinrock,  "A  Continuum  of 
Algorithms,"  Proc.  1970  SJCC, 

The  priority  of  a  progra 
of  two  parameters: 

a  =  priority  slope  while 
b  =  priority  slope  while 
By  varying  these  parameters  a 
from  FCFS  to  RR  is  described. 
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rKleinrock  1970b] 

L.  Kleinrock,  "Swap-Time  Considerations  in  Ti 
Systenrs,."  IFFF  T  on  C  19,6,  pp.  534-540. 

The  expected  swap  time  as  a  function  of 
time  and  number  of  users  is  found  for  systems 
representing  time-shared  and  processor- shared 


me-Shared 

mean  service 
of  queues 
systems . 
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r  Klein  rock  19*721 

L.  Kleinrock  and  P.P.  Kuntz,  ’’Processor  Sharing  Oueueing 
Kodels  of  Kixed  Scheduling  Disciplines  for  Time-Shared 
Systems^”  JA.CF!  19,3,  pp.  464-482. 

Typically,  PP  and  '^Fn  are  the  processor  sharing 
disciplines  studied;  the  objective  function  is  T(t)  --  time 
i.n  system  conditioned  upon  service  time  for  each  program. 
This  paper  studies  an  inf inite- leveled  PP  system  where  the 
service  discipline  at  each  level  may  be  PC^s,  FB,  or  PP. 


r  La mpson  1 96 8  1 

B.W.  T.ampson,  ”A  Scheduling  Philosophy  for  Multiprocessing 
Systems,”  CACM  11,9,  pp.  347-360. 

A  general  and  introductory  paper  dealing  with  process 
communication  and  synchronization. 


rian  1971] 

J.C.  Ian,  "A  Study  of  Job  Scheduling  and  Its  Interaction  with 
CPU  Scheduling,"  TSh^-24,  Univ.  of  Texas  at  Austin  Cnmputation 
Center,  December  1971. 

""his  K.sc.  thesis  represents  the  research  From  which 
'■  Frowne  1^72  1  was  derived,  A  simulation  model  of  the  CDC 
6600  /  UT- 1  sys'^'em  reveals  that  shortest  "^ob  first  memory 
schedulinq  coupled  with  round-robin  central  processar 
schedulina  yields  highest  +--hr ough put.  The  round-robin 
alaori+hm  is  found  especially  valuable  when  service  time 
d  is'^r  ibut  ion  s  are  highly  shewed. 


riasser  1969] 

D.J.  Lasser,  ’'Product ivity  of  Mult iprogrammed  Computers: 
Progress  in  Develooina  an  Analytic  Prediction  Method,”  CACM 
12,12,  pp.  678-6P4. 

Attempts  to  predict  the  progress  rates  of  various 
mult iproarammed  nobs  as  a  function  of  load;  a  non-simulation 
prediction  method  ^or  jobs  with  cyclic  demands. 


^ Laue  sen  1  973  ] 

5.  Lauesen,  "Job  Scheduling  Guaranteeing  Peasonable  Turn- 
Around  Times,"  Acta  Informatica  2,1,  pp.  1-11. 

The  BOSS  2  system  for  the  PC-4C00  is  described. 

Prioritv  is  computed  as  a  linear  function  of  a  loss  constant 
times  the  ratio  of  +otal  flow  time  and  estimated  execution 
time,  until  some  maximum  time  limit  is  exceeded.  At  this 
ucint  a  large  constant  is  inserted.  Besources  are  allocated 
using  the  Banker’s  Algorithm. 
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r  Lester  19*73] 

P.A.  I.ester,  *’The  Tnvestigat ior.  of  Service  Time 
Distributions,"  T.P.  25,  Computer  Systems  Research  Sroup, 
rjniv,  of  Toronto,  April  1973. 


A  number  of  improved 
require  information  about 
questions  are  considered: 
come  closest  to  approxima 
observed  (in  a  particular 
supplied  information  shou 
accurately  partition  jobs 


scheduling  rules  (SPT,  SR,  etc.) 
the  service  tiroes  of  jobs.  Two 
What  theoretical  distributions 
ting  the  actual  distributions 
environment)?  Upon  what  user- 
id  one  condition  in  order  to 
by  service  time? 


[Mahl  1970] 

R.  Mahl,  "An  Analytical  Approach  to  Computer  Systems 
Scheduling,"  CSc- 70-100,  Univ.  of  Utah,  June  1970. 

Proposes  that  users  be  required  to  furnish  scheduliig 
information  so  that  the  supervisor  may  function  as  follows: 

Given  Aij,  a  matrix  indicating  how  great  a  percentage  of 
its  time  job  i  will  make  use  of  resource  j,  and  Ci,  a  vector 
of  external  priorities  for  the  jobs  i,  determine  Pij,  the 
integer  priorities  of  the  jobs  on  the  resources,  and  Wi,  the 
progress  rates  of  the  jobs,  so  as  to  maximize  some  economic 
function. 

A  confusing  presentation. 


[Marshall  1969] 

B.S.  Marshall,  "Dynamic  Calculation  of  Dispatching  Priorities 
Under  OS/360-MVT,"  Datamation  15,8,  pp.  93-97. 

This  modification  to  OS/MVT  reorders  task  control  blocks 
every  five  seconds  to  correspond  to  the  following  priority 
scheme:  P  =  (T/0  time  *  100) /(I/O  time  +  CPU  time).  Limited 

benchmark  testing  showed  a  mean  throughput  improvement  of 
19*^. 


r  McKinney  1969  ] 

J.M.  McKinney,  "A  Survey  of  Analytical  Time  Sharing  Models," 
Comp.  Surveys  1,2,  pp.  105-116. 

A  good,  brief  survey  of  a  large  number  of  approaches. 
Includes  an  excellent  bibliography. 


r  Metzger  1972  ] 

J.K.  Metzger,  Course  Motes,  CSC  2206,  "Systems  Analysis," 
Univ,  of  Toronto,  1'^72. 


-103- 


r  Witrani  1 972a  1 

I.  flitrani,  "A  Queueing  Model  of  Priority  Multiprogramming," 
T.B.  41,  Univ.  of  Newcastle  upon  Tyne,  December  1972. 

This  paper  is  concerned  with  analysing  the  circumstances 
under  which  preemptive  priority  at  an  I/O  processor  is 
superior  to  non-preemptive  priority  at  that  processar.  The 
model  has  two  processors  and  N  distinct  programs  that 
circulate  continuously  in  the  system. 


[ Mitrani  1 972b  ] 

I.  Mitrani,  "Non-Priority  Multiprogramming  Systems  Under 
Heavy  Demand  Conditions  -  Customers’  Viewpoint,"  JACM  19,3, 
pp.  445-452. 

A  cyclic  gueueing  model  of  a  two-processor  system  is 
considered,^  where  FCFS  is  practiced  at  each  processor.  The 
author  attempts  to  determine  the  optimal  degree  of 
multiprogramming  from  the  viewpoint  o'f  a  program-oriented 
objective  function. 


r  Mullery  1970  ] 

A.D.  Mullery  and  G.C.  Driscoll,  "A  Processor  Allocation 
Method  for  Time-Sharing,"  CACM  13,1,  pp,  10-14. 

When  your  system  is  less  than  fully  loaded,  i.e.  when 
you're  not  concerned  with  overhead,  this  algorithm  will 
provide  a  decreased  number  of  preemptions  and  consequent 
savings  in  supervisor  time.  Alas  . . . 


r  Munck  1971 ] 

P.G.  Munck,  "A  Table-Driven  Scheduler  for  Widely  Diverse 
Requirements,"  abstract  in  Proc.  1971  IEEE  International 
Computer  Society  Conference,  pp.  183-184, 

IBM's  CP-67  was  modified  to  include  slot  sorting,  a  pra- 
paging-out  mechanism,  and  a  table- driven  scheduler  that 
responds  to  the  CPU,  I/O  and  paging  demands  of  each  user 
according  to  system  load. 


[ Muntz  1972] 

R.R,  Muntz  and  F,  Baskett,  "Open,  Closed  and  Mixed  Networks 
of  Queues  with  Different  Classes  of  Customers,"  T,R.  33, 
Stanford  Digital  Systems  Laboratories,  August  1972. 

Chandy's  concept  of  "local  balance"  is  used  to  solve 
models  that  include  multiple  job  classes  and  four  different 
service  disciplines:  FCFS,  processor  sharing,  no  queueing, 
and  LCFS.  At  the  latter  three  types  of  service  stations, 
service  time  distributions  need  only  have  rational  Laplace 
transforms. 
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At  the  completion  of  a  service  request,  a  program  may 
shange  both  class  and  service  station  according  to  transition 
probabilities.  Tt  is  shown  that  this  facility  is 
sufficiently  powerful  to  allow  models  with  other  than 
exponential  service  time  distributions  to  be  transformed  into 
nearly  equivalent  models  with  only  exponential  distributions. 

[Nielsen  1 967  3 

N.R.  Nielsen,  "The  Simulation  of  Time  Sharing  Systems,”  CACM 
10,7,  pp.  397-412. 

A  FORTRAN  simulation  of  TSS  on  the  /360-67  prior  to  its 
introduction.  Various  bottlenecks  and  difficulties  were 
pinpointed . 


[ Nielsen  1971] 

N.R.  Nielsen, 
CACH  14,2,  pp 


"An  Analysis  of  Some  Time  Sharing  Techniques,” 
79-90. 
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memory  compaction  prior  to 
e  disk  queueing  mechanisms. 


r  Omahen  1 972 ] 

K.  Omahen  and  L.F.  Schrage,  "A  Queueing  Analysis  of  a 
Multiprocessor  System  with  Shared  Memory,"  Pros.  Symposium  on 
Computer  Communications  Networks  and  Teletraffic,  Polytechnic 
Institute  of  Prooklynr  April  1972. 

A  readable  summary  of  the  results  of  Omahen's  Ph.D. 
thesis  ([Omahen  1973  ]). 


[ Omah  en  1 973 ] 


K.  Omahen,  "A.nalyt 
Ph.D.  Thesis,  Univ 
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[ Price  19721 

T.G.  Price,  ''An  Analysis  of  Central  Processor  Scheduling  in 
Multiprog  rammed  Computer  Systems,"  T.P.  57,  Stanford  Digital 
Systems  Laboratories,  October  1972. 

Proves  the  not-t oo-surprising  result  that  SRPT  is 
optimal  vith  respect  to  total  processor  utilization  in  a  two 
processor  system  when  one  of  the  processors  has  as  nany 
servers  as  there  are  programs;  i.e.  queueing  occurs  at  only 
one  processor. 


[ Fodriguez-Rosell  19721 

J.  Rodriguez-Rosel 1 ,  "Experimental  Data  on  How  Program 
Behavior  Affects  the  Choice  of  Scheduler  Parameters,"  SISOP5 
6,1S2,  pp.  156-163. 

The  "system  operating  point"  is  defined  as  the  point  in 
the  multidimensional  resource  space  that  corresponds  to  the 
instantaneous  utilization  levels.  It  is  desirable  to 
maintain  this  point  within  certain  tolerances.  Data  from 
interpretive  program  traces  is  analyzed  to  determine  whether 
the  working  set  size  of  programs  may  be  estimated  accurately 
enough  to  allow  effective  operating  point  control  in  the 
(processor, memory)  space. 


[  Rod r iguez-Posell  1973  1 

J.  Rodriguez-Rosell  and  J.-P.  Dupuy,  "The  Design, 
Implementation  and  Evaluation  of  a  Working  Set  Dispatcher," 
CACH  16,4,  pp.  247-253. 

In  a  practical  application  of  the  "operating  point" 
philosophy,  CP-67  was  modified  in  a  production  environment  to 
include  a  working  set  size  estimation  procedure  and  dispatch- 
criterion. 


r  Ryder  19701 

K.D.  Ryder,  "A  Heuristic  Approach  to  Task  Dispatching,"  IBM 
Sys.  J,  9,3,  pp.  189-198. 

Describes  the  subgroup  dispatching  algorithm  available 
in  VS-2  Release  2  and  later  MVT  systems.  Within  a  specified 
subgroup  of  programs,  priority  is  determined  as  a  functicn  of 
I/O  activity  —  higher  priority  to  more  I/O-bound  tasks. 

Benchmark  result-s  are  provided  for  three  jobstreams  on 
three  machines. 


[Scherr  19671 

A.L.  Scherr,  An  Analysis  of  Time-Shared  Computer  Systems,  MIT 
Press,  1967, 

One  of  the  first  attempts  to  simulate  the  behavior  of  a 
time-sharing  system,  and  still  one  of  the  best.  Scherr 
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conducts  exten sive  validations,  including  detailed  empiri cal 
modelling  of  the  behavior  of  the  CT?S  user  community. 


r  Schrage  1 968  1 

L.E.  Schrage,  "A  Proof  of  the  Optimality  of  the  SRPT 
Discipline,"  Op.  Pes.  16,3,  pp.  687-690. 

SPPT  is  shown  optimal  by  the  criterion  of  number  of 
programs  remaining  in  system,  regardless  of  the  arrival  or 
processing  time  distributions.  An  interchange  argument  is 
used  to  show  that  any  schedule  that  is  non-SRPT  over  an 
interval  of  time  may  be  improved  by  practicing  the  SRPT 
discipline  over  that  interval. 


r  Schrage  1969  ] 

L.E.  Schrage,  "Analysis  and  Optimization  of  a  Queueing  Model 
of  a  Real-Time  Computer  Control  System,"  IEEE  T  on  C  18,11, 
pp.  997-1003. 

Programs  of  a  particular  class  may  have  three  types  of 
priority  over  programs  of  a  lower  class:  preemptive-resume, 
preemptive-restart  and  non-preemptive .  The  type  of  priority 
that  one  class  has  over  another  should,  in  many  cases,  change 
as  a  function  of  time.  The  optimal  selection  of  thresholds 
for  these  priority  types  is  discussed  for  various  preemption 
cost  values. 


[ Schrage  1974] 

L.E.  Schrage,  "Optimal  Scheduling  Rules  for  a  Single 
Processor,"  abstract  in  Proc.  Seventh  Hawaii  International 
Conference  on  System  Sciences,  January  1974. 

Optimal  scheduling  rules  and  their  proofs  are  presented 
for  the  cases  of  zero  or  infinite  preemption  cost  and  full, 
probabilistic  or  no  service  time  information.  A.n  excellent 
review  of  current  results. 


rsevcik  1972a] 

K.C.  Sevcik,  "The  Use  of  Service  Time  Distributions  in 
Scheduling,"  T.R.  14,  Computer  Systems  Research  Group,  Univ. 
of  Toronto,  May  1972. 

The  smallest  rank  (=  expected  investment  /  payoff) 
discipline  is  shown  to  be  optimal  with  free  preemption, 
linear  losses  and  no  arrivals.  SR  is  a  generalization  of  the 
various  rules  that  had  previously  been  shown  optimal  in 
subcases  of  the  above  general  case. 

Under  linear  losses,  Poisson  arrivals  and  known  service 
time  distributions,  SP  is  shown  optimal  among  selection 
r'ules.  It  is  conjectured  that  a  selection  rule  is  sufficient 
in  this  case. 
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rsevcik  1972bl 

K.C.  Sevcik,  ’’Optimll  Processor  Scheduling  When  Service  Times 
A.re  Hyperexponentially  Distributed  and  Preemption  Overhead  is 
Not  Negligible,"  Proc.  Symposium  on  Computer  Communications 
Networks  and  Teletraffic,  Polytechnic  Institute  of  Brooklyn, 
A.pril  1972. 

The  smallest  rank  (SR)  discipline,  which  specializes  to 
PS  for  HP  service  time  distributions,  minimizes  average  time 
in  system  when  preemption  is  free.  In  the  case  of  non- 
negligible  preemption  cost,  it  is  shown  that  the  value  to  the 
schedule  of  each  additional  preemption  is  monotone 
decreasing.  The  limited  PR  discipline  (a  fixed-size  RR  queue 
from  which  users  drop  into  a  FCFS  "background"  queue)  is 
shown  to  be  near-optimal  for  a  particular  case. 


[ Sevcik  1974] 

K.C.  Sevcik,  "Computer  System  Modelling  and  Analysis: 
Assessing  Some  Common  Assumptions,"  abstract  in  Proc,  Seventh 
Hawaii  International  Conference  on  System  Sciences,  January 
1974. 

This  paper  discusses  the  validity  and  effect  of  various 
distribution  assumptions  in  central  processor  models  of 
computing  systems.  The  following  assumptions  are  discussed: 
exponential  interarrival  and  service  times;  zero  preemption 
overhead;  identically  distributed  service  times;  independence 
of  successive  service  intervals;  and  stationarity  of 
d istributions. 


r  Sherman  1 972a  ] 

S.W,  Sherman,  "Trace-Driven  Modeling  Studies  of  the 
Performance  of  Computer  Systems,"  TSN-30,  Univ.  of  Texas  at 
Austin  Computation  Center,  August  1972. 

The  primary  goal  of  this  thesis  is  to  exhibit  the  power 
and  advantages  of  trace-driven  modeling.  Descriptions  are 
provided  of  the  DT-1  and  UT-2  systems  for  the  CDC  6600. 
Parameters  of  the  model  are  discussed.  The  following 
problems  are  examined:  CPU  scheduling  (discussed  more  fully 
in  [Sherman  1972b]) ;  cost  of  deadlock  algorithms;  effect  of 
decreasing  the  degree  of  multiprogramming;  analysis  of 
proposed  hardware  changes;  channel  scheduling;  response  time. 


r  Sher ma n  1 972b ] 

S.W.  Sherman,  F.  Paskett  and  J.C.  Browne,  "Trace-Driven 
Modelling  and  Analysis  of  CPU  Scheduling  in  a 
Multiprogramming  System,"  CACM  15,12,  pp.  1063-1069. 

A  number  of  schedules,  from  theoretically  optimal  to 
random,  are  tried  on  a  simulator  of  the  CDC  6600.  The 
following  predictors  of  service  time  are  used:  precise 
advance  knowledge,  varying  degrees  of  exponential  smoothing. 
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complete  job  history,  random  guess.  Both  preemptive  and  npn- 
preemptive  rules  are  considered. 


[Stevens  1968] 

D.F.  Stevens,  ’*0n  Overcoming  High-Priority  Paralysis  in 
Multiprogramming  Systems:  P.  Case  History,'*  CACM  11,8,  pp. 
539-541. 

A  modification  of  the  CDC  6600/Chippewa  system  to 
increase  throughput.  Instead  of  assigning  control  points  on 
a  fixed-priority  basis,  the  highest-priority  job  that  will 
fit  in  memory  gets  th^  nod.  The  priority  of  I/O-bound 
executing  jobs  is  artificially  hiked.  Rough  measurements  are 
presented  to  substantiate  improvement  claims. 


[ Varney  1972 ] 

R.C.  Varney,  ’’Process  Selection  in  a  Hierarchical  Operating 
System,”  SIGOPS  6,182,  pp.  106-108. 

A  scheduling  strategy  is  proposed  for  systems  that  are 
represented  as  hierarchical  trees  of  processes.  The 
’’schedule”  is  an  algorithm  according  to  which  the  tree  is 
traversed.  The  strategy  is  altered  by  changing  the  traversal 
algorithm  or  by  rearranging  the  descendants  of  a  particular 
node. 


r 


White  1964] 

P.  White,  ’’Relative  Effects  of  Central  Processor  and 
Output  Speeds  Upon  Throughput  on  the  Large  Computer,” 
7,12,  pp.  711-714. 

The  duration  of  I/O-CPU  overlap  periods  is  somet 
CPU-sensitive  and  sometimes  I/O-sensitive,  in  the 
uniprogramming  environment  considered  here.  A  hardwa 
monitoring  method  is  described  that  isolates  the  two 
and  allows  prediction  of  expected  performance  improve 
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Appendix  A:  A  proof  that  some  pair  of  full  prioritj^  schedules  is 
sufficient  to  achieve  maximum  total  processor  utilization 

It  was  shown  in  Section  5,3  that,,  in  a  system  of  twD 
processors  and  two  program  classes,  a  pair  of  full  priority 
schedules  is  sufficient  to  achieve  maximum  total  processor 
utilization  if  this  objective  function  is  monotone  in  each  of  the 
two  scheduling  parameters.  The  following  proof  of  monotonicity 
is  due  largely  to  the  efforts  of  Michael  Bauer. 

Theorem:  In  a  2P2C  system,  total  processor  utilization  is 

monotone  in  each  of  the  two  scheduling  parameters. 

Outline  of  proof:  A  function  will  be  defined  and  sufficient 
conditions  for  its  monotonicity  stated  and  proven.  It  will  then 
be  shown  that  the  expression  for  total  processor  utilization  has 
the  form  of  this  function,  and  it  will  be  proven  that  it 
satisfies  the  mionotonicit y  conditions. 


Lemma  _1’  b®t  F(x)  =  (Ax  +  B)/(Cx  +  D)  ,  where  A,  B,  C  and  D  are 
constants,  and  (Cx  +  D)  has  no  zero  for  any  x  in  [0,1].  Then  F 
is  monotone  in  x  in  that  interval. 

E.I.22f  •  The  first  derivative  of  F  (x)  is: 

AD  -  BC 

7Cx“+"rTy7 

which  is  finite  and  has  the  same  sign  for  all  x  in  [0,1]. 

OED  Lenma  2* 
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Lemma  2:  Total  processor  utilization  can  be  written  in  the  form 

Ak  +  B1  +  Ckl  +  D 
T!]c“+~?T~T-Ulfr“T-H 

where  k  and  1  are  the  two  scheduling  parameters,  and  A  through  H 
are  constants  composed  -of  the  four  rate  matrix  entries. 

Proof:  As  explained  in  Section  5.3,  total  processor  utilization 

is  equal  to  (1  +  Pr[  (0,1)  ]  +  Pr[  (1,0)  ]  )/2.  Maximizing  this 

function  is  equivalent  to  maximizing  the  sum  of  these  two  steady 

state  probabilities.  Considering:  the  rate  matrix: 

Class  1  Class  2 
Dev.  0  I  a  b 

Dev.  1  I  c  d 

ani  letting  q((0,0),1)  be  represented  by  k  and  q((1,1),1)  by  1, 
the  following  are  the  solutions  for  the  two  steady  state 
probabilities  of  interest: 

Denominator  for  both  Prf  (0,1)  ]  and  Pr[  (1,0)  ]: 

a^dkl  +  ab2k  +  abck  +  abdl  +  acdl  +  aa2kl  +  b2ckl  +  bc2kl  + 
bcdk  +  cd2l  -  a2bk  -  a2dk  -  ab2  -  abc  -  abckl  -  abd  - 
abdkl  -  acd  -  acdkl  -  ad2k  -  b2cl  -  bc21  -  bed  -  bcdkl 
c2dl  -  cd2 

Numerator  for  Pr[  (0,1)  ]: 

b2ckl  +  bc2kl  +  bcdk  +  bcdl  -  abckl  -  b2cl  -  bc21  -  bed  - 
bcdkl 

Numerator  for  Prf  (1,0) ]: 

a2dkl  +  abdk  +  abdl  +  ad2kl  -  a2dk  -  abd  -  abdkl  -  acdkl  - 
ad  2  k 

Terms  may  be  grouped  so  that  the  function  has  the  form  of  this 
lemma,  with: 


A 

— 

bed  + 

abd  - 

a2  d 

— 

ad2 

B 

= 

bed  - 

b2c  - 

bc2 

+ 

abd 

= 

b2c  + 

bc2  - 

abc 

- 

bed  +  a2d  + 

ad2  - 

abd  -  acd 

D 

= 

-  bed 

-  abd 

•p 

= 

ab2  + 

abc  + 

bed 

- 

a2b  -  a2d  - 

ad2 

p 

= 

abd  + 

acd  + 

cd2 

- 

b2c  -  bc2  - 

c2d 

G 

= 

a2d  + 

ad2  + 

b2c 

+ 

bc2  -  abc  - 

abd  - 

acd  -  bed 

H 

= 

-  ab2 

-  abc 

-  abd 

-  acd  -  bed 

-  cd2 

ODD  Lemma  2,. 
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Lerama  3:  The  denominator  of  the  total  processor  utilization 
function,  Ek  +  Fl  +  Gkl  +  H,  is  non-zero  for  k  and  1  in  [0,1]. 

Proof:  Since  the  rates,  a,  b,  c  and  d,  are  constrained  to  be 

positive,  H  is  always  negative  (Lemma  2)  and  the  entire 
expression  is  negative  for  k  =  1  =  0.  Do  there  exist  values  of  k 
and  1  in  [0,1]  for  which  the  expression  is  non-negative?  Only  if 
Ek  +  El  +  Gkl  >  -  H.  It  must  be  shown  that  Ek  +  El  +  Gkl  <  -  H 
for  all  k  and  1  in  [0,1]. 


Substituting  for  F,  E, 
results  in  the  followin 
ab2k  +  abc(k  -  kl) 

+  cd21  -  (a2b 
+  ad2  +  b^c  + 
Because  k  and  1  are  in 
satisfied : 


G  and  H  from  Lemma  2  and  collecting  terms 
g  form  for  the  above  inequality: 

+  abd(l  -  kl)  +  acd  (1  -  kl)  +  bcd(k  -  kl) 

+  a2d  +  ad2)k  -  (b2c  +  bc2  +  c2d)l  +  (a2d 
bc2) kl  <  ab2  +  abc  +  abd  +  acd  +  bed  +  cd^ 
[0,1],  the  following  six  inequalities  are 


ab^k  <  ab2 


abc(k  -  kl)  <  abc 
a bd  (1  -  kl)  <  abd 
acd{l  -  kl)  <  acd 
bed (k  -  kl)  <  bed 
cd21  <  cd2 

Note  that  the  first  of  these  inequalities  is  <  for  k  <  1,  while 
the  sixth  is  <  for  1  <”  1.  When  k  and  1  are  both  1,  then  the 
other  four  inequalities  are  all  <.  Proof  of  the  lemma,  which 
requires  that  the  inequality  be  <,  is  now  reduced  to  showing 


that : 

(a2d  +  ad2  +  b2c  +  bc2)kl  <  (a2b  +  a2d  +  ad2)  k  +  (b2c  +  bc2  + 
c2d)  1 


But 

(a2d  +  ad2)kl  <  (a2d  +  ad2)  k 

and 


(b2c  +  bc2)  kl  <  {b2c  +  bc2)l 
and  since 

a2bk  +  c2dl  >  0 
the  lemma  is  proven. 
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Lernira  4:  Total  processor  utilization  is  monotone  in  k,  or 
q((0,C),1). 

Proof:  Fix  1,  the  other  scheduling  parameter,  at  some  value  1* 

in  [P,1].  Pearranging  terms  from  Lemma  2,  utilizatit)n  equals: 
(A  +  CIM  k  +  (Bl'  +  D) 

This  function  has  the  form  of  the  function  in  Lemma  1,  and  by 
Lemma  3,  the  denominator  is  non-zero  for  k  and  1'  in  [0,1]. 

QFP  lamina  4. 

Lemma  5:  Total  processor  utilization  is  monotone  in  1,  or 

q((1r  1)  r  1)  . 

other  scheduling  parameter,  at  some  value  k' 
in  [0,1].  Rearranging  terms  from  Lemma  2,  utilization  equals: 
(P  +  CkMl  +  (Ak*  +  P) 

This  function  has  the  form  of  the  function  in  Lemma  1,  and  by 
Lemma  3,  the  deiromi-na t or  is  non-zero  for  k'  and  1  in  [0,1]. 


It  has  been  shown  that  total  processor  utilization  in  a  2P2C 
system  is  monotone  in  each  of  the  two  scheduling  parameters,  and 
consequently  that  some  pair  of  full  priority  schedules  is 
sufficient  to  achieve  m>aximum  total  processor  utilization 
a  system. 


in  such 


A-ppendix  B:  A  proof  of  the  optimality  of  the  SBT-RSBT  ^iscip^line 


Section  5.3  and  A.ppendix  A  are  devoted  to  a  proof  that  some 
pair  of  full  priority  schedules  is  optimal  with  respect  to  total 
processor  utilization  in  a  2P2C  system.  Section  5.4  defines  the 
SBT-RSBT  scheduling  rule,  a  "compound"  discipline  equivalent  to 
the  "better"  of  the  individual  rules  SBT  and  RSBT  in  every 
situation,  and  claims  that  it  is  optimal  when  applied  at  both 
processors.  This  section  contains  the  optimality  proof. 

Theorem:  In  a  2P2C  system,  the  schedule  pair  selected  by 

globally-applied  SBT-PSBT  is  superior  to  all  other  full  priority 
schedule  pairs  with  respect  to  total  processor  utilization,  and 
is  therefore  optimal. 

2f  E£.22f'  case  analysis  is  employed.  The  number  of 
cases  that  must  be  considered  is  reduced  by  taking  advantage  of 
the  symmetries  of  the  2P2C  system,  as  described  in  Sections  5.1 
and  5.4.  In  a  particular  case,  all  that  must  be  shown  is  that 
for  the  allowable  burst  rate  values,  any  full  priority  schedule 
pair  not  specified  by  one  of  the  two  rules  SBT  and  RSBT  is 
inferior  to  a  pair  specified  by  one  of  these  rules. 


Lemma  1_:  Recall  that  the  notation  (irj)  in  reference  to  full 
priority  schedule  pairs  indicates  that  the  class  i  program  has 
full  preemptive  priority  at  processor  zero  and  the  class  j 
program  is  favored,  at  processor  one.  For  the  standard  burst  rate 
matrix,  the  expressions  that  follow  represent  total  processor 
utilization  for  the  four  pairs  of  full  priority  schedules: 

(1,2)  : 


(1.1) :  ac (b+dy 

Ta+cfTabTbdTW 

(2.1) :  be 

abTbc+cd 


(2,2)  : 


ad 

abTa^Tcd 

bd  (a^-c) 

TS+dflabTacTcdr 


E£22£  *  expressions  were  obtained  by  substituting  the 

appropriate  values  for  k  and  1  (the  scheduling  parameters)  iu  the 
total  utilization  function  appearing  in  Lemma  2  of  Appendix  A. 
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L®;rma  2:  When  SPT  applied  globally  either  gives  priority  to  a 
different  program  class  at  each  processor  or  is  unable  to  choose 
at  one  processor,  the  schedule  pair  specified  by  globally-applied 
PSBT  is  superior  to  any  other  full  priority  pair. 

Proof:  Because  of  symmetry,  only  one  of  the  two  SBT-specif ied 

schedule  pairs  (1,2)  and  (2,1)  must  be  considered.  Treating  the 
former  (and  allovring  for  the  "can*t  choose"  boundary  condition) 
requires  a  >  b  and  d  >  c.  If  a  =  b  and  d  =  c  then  the  schedule 

pair  has  no  effect  upon  the  objective  function,  so  ad  >  be  and 

^SPT  specifies  (1,2).  There  are  three  cases: 

1*  (”1^2)  is  superior  to  (2,1)  .  From  Lemma  1,  this 

requires  verifying  that: 

_  _ad _  >  _  _bc  _ 

ab+ad+cd”  aF+bcTcd 

Expanding  and  recombining  yields: 

ad  (ab+cd)  +  abed  >  bc(ab+cd)  +  abed 

ad  >  be 

which  is  true  by  the  conditions  of  this  lemma. 

2ase  2:  (1,2)  is  superior  to  (1,1).  From  Lemma  1,  this 

requires  verifyina  that: 


ad 


> 


ab+ad+cd  "(s+cyTab+bd+cd)" 

Expanding  and  recombining  yields: 

(ad  +  cd)  (ab+bd+cd)  >  (bc+cd)  (ab  +  ad  +  cd) 
ad^  +  cd^  +  a^d  >  acd  +  c^d  +  abc 
ad  (d-c)  +  cd  (d-c)  +  a(ad-bc)  >  0 

which  is  true  by  the  conditions  of  this  lemma. 

2ase  3:  (1,2)  is  superior  to  (2,2)  .  "^rom  Lemma  1,  this 

requires  verifying  that: 

_  _ad _ >  _  bd_(a  +  c) _ 

ab+ad  +  cd  Tb+dy'(a'5+ ac+cdf 

Expanding  and  recombining  yields: 

(ab+ad)  (ab+ac  +  cd)  >  (ab+bc)  (ab+ad  +  cd) 

a^bc  +  a^cd  +  acd^  >  ab^c  +  abed  +  bc^d 

abc(a-b)  +  acd  (a-b)  +  cd  (ad-bc)  >  0 

which  is  true  by  the  conditions  of  this  lemma. 

The  schedule  pair  specified  by  global  P.SPT  is  therefore  superior 
to  any  other  full  priority  pair  whenever  global  SBT  either  gives 
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priority  to  a  different  program  class  at  each  processor  or  is 
unable  to  choose  at  one  processor.  Note  that  if  the  boundary 
condition  is  excluded,  SBT  and  PSBT  specify  the  same  schedule 
pair, 

^ED  Lemjia  2. 


Lemma  3:  When  SBT  applied  globally  gives  priority  to  the  same 
program  class  at  each  processor,  the  schedule  pair  specified  by 
globally-applied  SBT-PSBT  is  superior  to  any  other  full  priority 
pair. 

Since  PSBT  can  never  give  priority  to  the  same  class  at 
each  processor  (Section  5.4),  there  are  four  cases: 


SBT  RSBT 


1 

2 

3 

4 


Because  of  symmetry,  cases  three  and  four  are  subsumed  by  cases 
one  and  two  respectively,  and  case  two  is  subsumed  by  case  one. 
Proof  of  the  lemma  is  reduced  to  showing  that  when  SBT  specifies 

(1,1)  and  PSBT  specifies  (1,2)  (i.e,,  when  a  >  b,  c  >  d,  and  ad  > 

be) ,  each  of  the  two  other  full  priority  pairs  is  inferior  to  one 
of  these. 

1*  When  SBT  specifies  (1,1)  and  PSBT  specifies  (1,2), 

(1.1)  is  superior  to  (2,1).  Prom  Lemma  1,  this  requires 

verifying  that: 

ac(b  +  d)_  /  >  be 

TaTcriab'FbdPccir  aBTBcPcd 

Expanding  and  recombining  yields: 

(a2 b+abc+acd)  (b+d)  >  (ab^+b^d+bed)  (a+c) 

a^b  +  abc  +  acd  >  ab^  +  b^c  +  bc^ 

ab(a-b)  +  be  (a-b)  +  c(ad-bc)  >  0 

which  is  true  by  the  conditions  of  this  lemma. 

2*  1''hen  SBT  specifies  (1,1)  and  PSBT  specifies  (1,2) 

(1.2)  is  superior  to  (2,2).  Prom  Lemma  1,  this  requires 

verifying  that: 

ad  >  bd(a+c) 

abTadTcS  -(bT^rTabTaCPcSr 


9 
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Expanding  and  recombining  yields: 

(a^b+a^c  +  acd)  (b+d)  >  (ab^  +  abd+bcd)  (a-«-c) 
a2b  +  a^d  +  ad^  >  ab^  +  abd  +  bed 
ab(a-b)  +  ad  (a-b)  +  d(ctd-bc)  >  0 

which  is  true  by  the  conditions  of  this  lemma. 

The  SBT-RSBT  schedule  pair  is  therefore  superior  to  any  other 
full  priority  pair  when  SBT  gives  priority  to  the  same  program 
class  at  e-ach  processor. 

^ED  Lemma  3. 


Globally-applied  SBT  must  either  give  priority  to  the  same 
program  class  at  each  processor,  give  priority  to  a  different 
program  class  at  each  processor,  or  be  unable  to  choose.  For 
each  of  these  cases,  it  has  been  shown  that  any  full  priority 
schedule  pair  not  specified  by  one  of  the  two  rules  SBT  and  RSBT 
is  always  inferior  to  a  pair  specified  by  one  of  these  rules  with 
respect  to  total  processor  utilization.  Since  some  full  priority 
pair  must  be  optimal  by  the  theorem  in  Appendix  A,  SBT-RSBT  is  an 
optimal  scheduling  rule  for  a  2P2C  system. 
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